Back to Search Start Over

Use of diverse electronic medical record systems to identify genetic risk for type 2 diabetes within a genome-wide association study.

Authors :
Kho, Abel N.
Hayes, M. Geoffrey
Rasmussen-Torvik, Laura
Pacheco, Jennifer A.
Thompson, William K.
Armstrong, Loren L.
Denny, Joshua C.
Peissig, Peggy L.
Miller, Aaron W.
Wei, Wei-Qi
Bielinski, Suzette J.
Chute, Christopher G.
Leibson, Cynthia L.
Jarvik, Gail P.
Crosslin, David R.
Carlson, Christopher S.
Newton, Katherine M.
Wolf, Wendy A.
Chisholm, Rex L.
Lowe, William L.
Source :
Journal of the American Medical Informatics Association; Mar2012, Vol. 19 Issue 2, p212-218, 7p
Publication Year :
2012

Abstract

Objective Genome-wide association studies (GWAS) require high specificity and large numbers of subjects to identify genotype-phenotype correlations accurately. The aim of this study was to identify type 2 diabetes (T2D) cases and controls for a GWAS, using data captured through routine clinical care across five institutions using different electronic medical record (EMR) systems. Materials and Methods An algorithm was developed to identify T2D cases and controls based on a combination of diagnoses, medications, and laboratory results. The performance of the algorithm was validated at three of the five participating institutions compared against clinician review. A GWAS was subsequently performed using cases and controls identified by the algorithm, with samples pooled across all five institutions. Results The algorithm achieved 98% and 100% positive predictive values for the identification of diabetic cases and controls, respectively, as compared against clinician review. By standardizing and applying the algorithm across institutions, 3353 cases and 3352 controls were identified. Subsequent GWAS using data from five institutions replicated the TCF7L2 gene variant (rs7903146) previously associated with T2D. Discussion By applying stringent criteria to EMR data collected through routine clinical care, cases and controls for a GWAS were identified that subsequently replicated a known genetic variant. The use of standard terminologies to define data elements enabled pooling of subjects and data across five different institutions to achieve the robust numbers required for GWAS. Conclusions An algorithm using commonly available data from five different EMR can accurately identify T2D cases and controls for genetic study across multiple institutions. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10675027
Volume :
19
Issue :
2
Database :
Complementary Index
Journal :
Journal of the American Medical Informatics Association
Publication Type :
Academic Journal
Accession number :
71961552
Full Text :
https://doi.org/10.1136/amiajnl-2011-000439