23 results on '"Vy HMT"'
Search Results
2. Ancestrally and Temporally Diverse Analysis of Penetrance of Clinical Variants in 72,434 Individuals
- Author
-
Ruth J. F. Loos, Judy H. Cho, Ron Do, Daniel M. Jordan, Bafna S, Ghislain Rocheleau, Iain S. Forrest, Vy Hmt, and Kumardeep Chaudhary
- Subjects
Genetics ,education.field_of_study ,Population ,Familial hypercholesterolemia ,Disease ,Biology ,medicine.disease ,Biobank ,Penetrance ,Biomarker (cell) ,Breast cancer ,medicine ,education ,Exome - Abstract
A major goal of genomic medicine is to quantify the disease risk of genetic variants. Here, we report the penetrance of 37,772 clinically relevant variants (including those reported in ClinVar1 and of loss-of-function consequence) for 197 diseases in an analysis of exome sequence data for 72,434 individuals over five ancestries and six decades of ages from two large-scale population-based biobanks (BioMe Biobank and UK Biobank). With a high-quality set of 5,359 clinically impactful variants, we evaluate disease prevalence in carriers and non-carriers to interrogate major determinants and implications of penetrance. First, we associate biomarker levels with penetrance of variants in known disease-predisposition genes and illustrate their clear biological link to disease. We then systematically uncover large numbers of ClinVar pathogenic variants that confer low risk of disease, even among those reviewed by experts, while delineating stark differences in variant penetrance by molecular consequence. Furthermore, we ascertain numerous variants present in non-European ancestries and reveal how increasing carrier age modifies penetrance estimates. Lastly, we examine substantial heterogeneity of penetrance among variants in known disease-predisposition genes for conditions such as familial hypercholesterolemia and breast cancer. These data indicate that existing categorical systems for variant classification do not adequately capture disease risk and warrant consideration of a more quantitative system based on population-based penetrance to evaluate clinical impact.
- Published
- 2021
3. Use of Diagnostic Codes for Primary Open-Angle Glaucoma Polygenic Risk Score Construction in Electronic Health Record-Linked Biobanks.
- Author
-
Tran JH, Kang J, Han E, Gupta U, Seresirikachorn K, Vy HMT, Zhao Y, Rocheleau G, Luo Y, Lee R, Do R, Friedman DS, Kang JH, Wiggs JL, Pasquale LR, Segrè AV, and Zebardast N
- Subjects
- Humans, Retrospective Studies, Male, Female, Aged, Middle Aged, Biological Specimen Banks, Risk Factors, International Classification of Diseases, Visual Fields physiology, Multifactorial Inheritance, Area Under Curve, Tomography, Optical Coherence, Genome-Wide Association Study, Risk Assessment methods, ROC Curve, Predictive Value of Tests, Genetic Risk Score, Glaucoma, Open-Angle genetics, Glaucoma, Open-Angle diagnosis, Electronic Health Records, Intraocular Pressure physiology
- Abstract
Purpose: Polygenic risk scores (PRSs) likely predict risk and prognosis of glaucoma. We compared the PRS performance for primary open-angle glaucoma (POAG), defined using International Classification of Diseases (ICD) codes vs manual medical record review., Design: Retrospective cohort study., Methods: We identified POAG cases in the Mount Sinai BioMe and Mass General Brigham (MGB) biobanks using ICD codes. We confirmed POAG based on optical coherence tomograms and visual fields. In a separate 5% sample, the absence of POAG was confirmed with intraocular pressure and cup-disc ratio criteria. We used genotype data and either self-reported glaucoma diagnoses or ICD-10 codes for glaucoma diagnoses from the UK Biobank and the lassosum method to compute a genome-wide POAG PRS. We compared the area under the curve (AUC) for POAG prediction based on ICD codes vs medical records., Results: We reviewed 804 of 996 BioMe and 367 of 1006 MGB ICD-identified cases. In BioMe and MGB, respectively, positive predictive value was 53% and 55%; negative predictive value was 96% and 97%; sensitivity was 97% and 97%; and specificity was 44% and 53%. Adjusted PRS AUCs for POAG using ICD codes vs manual record review in BioMe were not statistically different (P ≥.21) by ancestry: 0.77 vs 0.75 for African, 0.80 vs 0.80 for Hispanic, and 0.81 vs 0.81 for European. Results were similar in MGB (P ≥.18): 0.72 vs 0.80 for African, 0.83 vs 0.86 for Hispanic, and 0.74 vs 0.73 for European., Conclusions: A POAG PRS performed similarly using either manual review or ICD codes in 2 electronic health record-linked biobanks; manual assessment of glaucoma status might not be necessary for some PRS studies. However, caution should be exercised when using ICD codes for glaucoma diagnosis given their low specificity (44%-53%) for manually confirmed cases of glaucoma., (Copyright © 2024 Elsevier Inc. All rights reserved.)
- Published
- 2024
- Full Text
- View/download PDF
4. Exome sequence analysis identifies rare coding variants associated with a machine learning-based marker for coronary artery disease.
- Author
-
Petrazzini BO, Forrest IS, Rocheleau G, Vy HMT, Márquez-Luna C, Duffy Á, Chen R, Park JK, Gibson K, Goonewardena SN, Malick WA, Rosenson RS, Jordan DM, and Do R
- Subjects
- Humans, Exome genetics, Exome Sequencing methods, Genetic Variation, Genome-Wide Association Study methods, Female, Polymorphism, Single Nucleotide, Coronary Artery Disease genetics, Machine Learning, Genetic Predisposition to Disease
- Abstract
Coronary artery disease (CAD) exists on a spectrum of disease represented by a combination of risk factors and pathogenic processes. An in silico score for CAD built using machine learning and clinical data in electronic health records captures disease progression, severity and underdiagnosis on this spectrum and could enhance genetic discovery efforts for CAD. Here we tested associations of rare and ultrarare coding variants with the in silico score for CAD in the UK Biobank, All of Us Research Program and BioMe Biobank. We identified associations in 17 genes; of these, 14 show at least moderate levels of prior genetic, biological and/or clinical support for CAD. We also observed an excess of ultrarare coding variants in 321 aggregated CAD genes, suggesting more ultrarare variant associations await discovery. These results expand our understanding of the genetic etiology of CAD and illustrate how digital markers can enhance genetic association investigations for complex diseases., (© 2024. The Author(s), under exclusive licence to Springer Nature America, Inc.)
- Published
- 2024
- Full Text
- View/download PDF
5. Time-to-Event Genome-Wide Association Study for Incident Cardiovascular Disease in People With Type 2 Diabetes.
- Author
-
Kwak SH, Hernandez-Cancela RB, DiCorpo DA, Condon DE, Merino J, Wu P, Brody JA, Yao J, Guo X, Ahmadizar F, Meyer M, Sincan M, Mercader JM, Lee S, Haessler J, Vy HMT, Lin Z, Armstrong ND, Gu S, Tsao NL, Lange LA, Wang N, Wiggins KL, Trompet S, Liu S, Loos RJF, Judy R, Schroeder PH, Hasbani NR, Bos MM, Morrison AC, Jackson RD, Reiner AP, Manson JE, Chaudhary NS, Carmichael LK, Chen YI, Taylor KD, Ghanbari M, van Meurs J, Pitsillides AN, Psaty BM, Noordam R, Do R, Park KS, Jukema JW, Kavousi M, Correa A, Rich SS, Damrauer SM, Hajek C, Cho NH, Irvin MR, Pankow JS, Nadkarni GN, Sladek R, Goodarzi MO, Florez JC, Chasman DI, Heckbert SR, Kooperberg C, Dupuis J, Malhotra R, de Vries PS, Liu CT, Rotter JI, and Meigs JB
- Subjects
- Humans, Female, Male, Middle Aged, Aged, Polymorphism, Single Nucleotide, Diabetes Mellitus, Type 2 genetics, Diabetes Mellitus, Type 2 epidemiology, Diabetes Mellitus, Type 2 complications, Genome-Wide Association Study, Cardiovascular Diseases genetics, Cardiovascular Diseases epidemiology
- Abstract
Objective: To identify genetic risk factors for incident cardiovascular disease (CVD) among people with type 2 diabetes (T2D)., Research Design and Methods: We conducted a multiancestry time-to-event genome-wide association study for incident CVD among people with T2D. We also tested 204 known coronary artery disease (CAD) variants for association with incident CVD., Results: Among 49,230 participants with T2D, 8,956 had incident CVD events (event rate 18.2%). We identified three novel genetic loci for incident CVD: rs147138607 (near CACNA1E/ZNF648, hazard ratio [HR] 1.23, P = 3.6 × 10-9), rs77142250 (near HS3ST1, HR 1.89, P = 9.9 × 10-9), and rs335407 (near TFB1M/NOX3, HR 1.25, P = 1.5 × 10-8). Among 204 known CAD loci, 5 were associated with incident CVD in T2D (multiple comparison-adjusted P < 0.00024, 0.05/204). A standardized polygenic score of these 204 variants was associated with incident CVD with HR 1.14 (P = 1.0 × 10-16)., Conclusions: The data point to novel and known genomic regions associated with incident CVD among individuals with T2D., (© 2024 by the American Diabetes Association.)
- Published
- 2024
- Full Text
- View/download PDF
6. Genome-first evaluation with exome sequence and clinical data uncovers underdiagnosed genetic disorders in a large healthcare system.
- Author
-
Forrest IS, Duffy Á, Park JK, Vy HMT, Pasquale LR, Nadkarni GN, Cho JH, and Do R
- Subjects
- Humans, Female, Male, Middle Aged, Adult, Genetic Diseases, Inborn genetics, Genetic Diseases, Inborn diagnosis, Genetic Diseases, Inborn epidemiology, Genetic Predisposition to Disease, Electronic Health Records, Genetic Testing methods, Genome, Human, Aged, Delivery of Health Care, Adolescent, Genomics methods, Young Adult, Exome genetics, Exome Sequencing methods
- Abstract
Population-based genomic screening may help diagnose individuals with disease-risk variants. Here, we perform a genome-first evaluation for nine disorders in 29,039 participants with linked exome sequences and electronic health records (EHRs). We identify 614 individuals with 303 pathogenic/likely pathogenic or predicted loss-of-function (P/LP/LoF) variants, yielding 644 observations; 487 observations (76%) lack a corresponding clinical diagnosis in the EHR. Upon further investigation, 75 clinically undiagnosed observations (15%) have evidence of symptomatic untreated disease, including familial hypercholesterolemia (3 of 6 [50%] undiagnosed observations with disease evidence) and breast cancer (23 of 106 [22%]). These genetic findings enable targeted phenotyping that reveals new diagnoses in previously undiagnosed individuals. Disease yield is greater with variants in penetrant genes for which disease is observed in carriers in an independent cohort. The prevalence of P/LP/LoF variants exceeds that of clinical diagnoses, and some clinically undiagnosed carriers are discovered to have disease. These results highlight the potential of population-based genomic screening., Competing Interests: Declaration of interests R.D. reported receiving grants from AstraZeneca and grants and nonfinancial support from Goldfinch Bio and being a scientific co-founder, consultant, and equity holder for Pensieve Health (pending) and a consultant for Variant Bio, all not related to this work. G.N.N. reported being a scientific co-founder, consultant, advisory board member, and equity owner of Renalytix AI; a scientific co-founder and equity holder for Pensieve Health (pending); and a consultant for Variant Bio and receiving grants from Goldfinch Bio and personal fees from Renalytix AI, BioVie, Reata, AstraZeneca, and GLG Consulting. L.R.P. is a consultant for Eyenovia, Twenty Twenty, and Skye Bioscience., (Copyright © 2024 The Author(s). Published by Elsevier Inc. All rights reserved.)
- Published
- 2024
- Full Text
- View/download PDF
7. Genome-Wide Polygenic Risk Score for CKD in Individuals with APOL1 High-Risk Genotypes.
- Author
-
Vy HMT, Coca SG, Sawant A, Sakhuja A, Gutierrez OM, Cooper R, Loos RJF, Horowitz CR, Do R, and Nadkarni GN
- Subjects
- Humans, Apolipoprotein L1 genetics, Genotype, Genetic Predisposition to Disease, Risk Factors, Genetic Risk Score, Renal Insufficiency, Chronic diagnosis, Renal Insufficiency, Chronic genetics
- Published
- 2024
- Full Text
- View/download PDF
8. A deep learning transformer model predicts high rates of undiagnosed rare disease in large electronic health systems.
- Author
-
Jordan DM, Vy HMT, and Do R
- Abstract
It is estimated that as many as 1 in 16 people worldwide suffer from rare diseases. Rare disease patients face difficulty finding diagnosis and treatment for their conditions, including long diagnostic odysseys, multiple incorrect diagnoses, and unavailable or prohibitively expensive treatments. As a result, it is likely that large electronic health record (EHR) systems include high numbers of participants suffering from undiagnosed rare disease. While this has been shown in detail for specific diseases, these studies are expensive and time consuming and have only been feasible to perform for a handful of the thousands of known rare diseases. The bulk of these undiagnosed cases are effectively hidden, with no straightforward way to differentiate them from healthy controls. The ability to access them at scale would enormously expand our capacity to study and develop drugs for rare diseases, adding to tools aimed at increasing availability of study cohorts for rare disease. In this study, we train a deep learning transformer algorithm, RarePT (Rare-Phenotype Prediction Transformer), to impute undiagnosed rare disease from EHR diagnosis codes in 436,407 participants in the UK Biobank and validated on an independent cohort from 3,333,560 individuals from the Mount Sinai Health System. We applied our model to 155 rare diagnosis codes with fewer than 250 cases each in the UK Biobank and predicted participants with elevated risk for each diagnosis, with the number of participants predicted to be at risk ranging from 85 to 22,000 for different diagnoses. These risk predictions are significantly associated with increased mortality for 65% of diagnoses, with disease burden expressed as disability-adjusted life years (DALY) for 73% of diagnoses, and with 72% of available disease-specific diagnostic tests. They are also highly enriched for known rare diagnoses in patients not included in the training set, with an odds ratio (OR) of 48.0 in cross-validation cohorts of the UK Biobank and an OR of 30.6 in the independent Mount Sinai Health System cohort. Most importantly, RarePT successfully screens for undiagnosed patients in 32 rare diseases with available diagnostic tests in the UK Biobank. Using the trained model to estimate the prevalence of undiagnosed disease in the UK Biobank for these 32 rare phenotypes, we find that at least 50% of patients remain undiagnosed for 20 of 32 diseases. These estimates provide empirical evidence of a high prevalence of undiagnosed rare disease, as well as demonstrating the enormous potential benefit of using RarePT to screen for undiagnosed rare disease patients in large electronic health systems., Competing Interests: Conflict of Interest Disclosures: Dr. Do reported receiving grants from AstraZeneca, grants and non-financial support from Goldfinch Bio, being a scientific co-founder, consultant and equity holder for Pensieve Health (pending), and being a consultant for Variant Bio, all not related to this study.
- Published
- 2023
- Full Text
- View/download PDF
9. Time-to-Event Genome-Wide Association Study for Incident Cardiovascular Disease in People with Type 2 Diabetes Mellitus.
- Author
-
Kwak SH, Hernandez-Cancela RB, DiCorpo DA, Condon DE, Merino J, Wu P, Brody JA, Yao J, Guo X, Ahmadizar F, Meyer M, Sincan M, Mercader JM, Lee S, Haessler J, Vy HMT, Lin Z, Armstrong ND, Gu S, Tsao NL, Lange LA, Wang N, Wiggins KL, Trompet S, Liu S, Loos RJF, Judy R, Schroeder PH, Hasbani NR, Bos MM, Morrison AC, Jackson RD, Reiner AP, Manson JE, Chaudhary NS, Carmichael LK, Chen YI, Taylor KD, Ghanbari M, van Meurs J, Pitsillides AN, Psaty BM, Noordam R, Do R, Park KS, Jukema JW, Kavousi M, Correa A, Rich SS, Damrauer SM, Hajek C, Cho NH, Irvin MR, Pankow JS, Nadkarni GN, Sladek R, Goodarzi MO, Florez JC, Chasman DI, Heckbert SR, Kooperberg C, Dupuis J, Malhotra R, de Vries PS, Liu CT, Rotter JI, and Meigs JB
- Abstract
Background: Type 2 diabetes mellitus (T2D) confers a two- to three-fold increased risk of cardiovascular disease (CVD). However, the mechanisms underlying increased CVD risk among people with T2D are only partially understood. We hypothesized that a genetic association study among people with T2D at risk for developing incident cardiovascular complications could provide insights into molecular genetic aspects underlying CVD., Methods: From 16 studies of the Cohorts for Heart & Aging Research in Genomic Epidemiology (CHARGE) Consortium, we conducted a multi-ancestry time-to-event genome-wide association study (GWAS) for incident CVD among people with T2D using Cox proportional hazards models. Incident CVD was defined based on a composite of coronary artery disease (CAD), stroke, and cardiovascular death that occurred at least one year after the diagnosis of T2D. Cohort-level estimated effect sizes were combined using inverse variance weighted fixed effects meta-analysis. We also tested 204 known CAD variants for association with incident CVD among patients with T2D., Results: A total of 49,230 participants with T2D were included in the analyses (31,118 European ancestries and 18,112 non-European ancestries) which consisted of 8,956 incident CVD cases over a range of mean follow-up duration between 3.2 and 33.7 years (event rate 18.2%). We identified three novel, distinct genetic loci for incident CVD among individuals with T2D that reached the threshold for genome-wide significance ( P <5.0×10
-8 ): rs147138607 (intergenic variant between CACNA1E and ZNF648 ) with a hazard ratio (HR) 1.23, 95% confidence interval (CI) 1.15 - 1.32, P =3.6×10-9 , rs11444867 (intergenic variant near HS3ST1 ) with HR 1.89, 95% CI 1.52 - 2.35, P =9.9×10-9 , and rs335407 (intergenic variant between TFB1M and NOX3 ) HR 1.25, 95% CI 1.16 - 1.35, P =1.5×10-8 . Among 204 known CAD loci, 32 were associated with incident CVD in people with T2D with P <0.05, and 5 were significant after Bonferroni correction ( P <0.00024, 0.05/204). A polygenic score of these 204 variants was significantly associated with incident CVD with HR 1.14 (95% CI 1.12 - 1.16) per 1 standard deviation increase ( P =1.0×10-16 )., Conclusions: The data point to novel and known genomic regions associated with incident CVD among individuals with T2D.- Published
- 2023
- Full Text
- View/download PDF
10. Genome-wide association study of thoracic aortic aneurysm and dissection in the Million Veteran Program.
- Author
-
Klarin D, Devineni P, Sendamarai AK, Angueira AR, Graham SE, Shen YH, Levin MG, Pirruccello JP, Surakka I, Karnam PR, Roychowdhury T, Li Y, Wang M, Aragam KG, Paruchuri K, Zuber V, Shakt GE, Tsao NL, Judy RL, Vy HMT, Verma SS, Rader DJ, Do R, Bavaria JE, Nadkarni GN, Ritchie MD, Burgess S, Guo DC, Ellinor PT, LeMaire SA, Milewicz DM, Willer CJ, Natarajan P, Tsao PS, Pyarajan S, and Damrauer SM
- Subjects
- Humans, Genome-Wide Association Study, Pedigree, Veterans, Aortic Aneurysm, Thoracic genetics, Aortic Dissection genetics
- Abstract
The current understanding of the genetic determinants of thoracic aortic aneurysms and dissections (TAAD) has largely been informed through studies of rare, Mendelian forms of disease. Here, we conducted a genome-wide association study (GWAS) of TAAD, testing ~25 million DNA sequence variants in 8,626 participants with and 453,043 participants without TAAD in the Million Veteran Program, with replication in an independent sample of 4,459 individuals with and 512,463 without TAAD from six cohorts. We identified 21 TAAD risk loci, 17 of which have not been previously reported. We leverage multiple downstream analytic methods to identify causal TAAD risk genes and cell types and provide human genetic evidence that TAAD is a non-atherosclerotic aortic disorder distinct from other forms of vascular disease. Our results demonstrate that the genetic architecture of TAAD mirrors that of other complex traits and that it is not solely inherited through protein-altering variants of large effect size., (© 2023. This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.)
- Published
- 2023
- Full Text
- View/download PDF
11. Polygenic prediction of preeclampsia and gestational hypertension.
- Author
-
Honigberg MC, Truong B, Khan RR, Xiao B, Bhatta L, Vy HMT, Guerrero RF, Schuermans A, Selvaraj MS, Patel AP, Koyama S, Cho SMJ, Vellarikkal SK, Trinder M, Urbut SM, Gray KJ, Brumpton BM, Patil S, Zöllner S, Antopia MC, Saxena R, Nadkarni GN, Do R, Yan Q, Pe'er I, Verma SS, Gupta RM, Haas DM, Martin HC, van Heel DA, Laisk T, and Natarajan P
- Subjects
- Pregnancy, Female, Child, Humans, Aspirin, Risk Factors, Hypertension, Pregnancy-Induced genetics, Pre-Eclampsia genetics, Pre-Eclampsia prevention & control, Eclampsia, Hypertension
- Abstract
Preeclampsia and gestational hypertension are common pregnancy complications associated with adverse maternal and child outcomes. Current tools for prediction, prevention and treatment are limited. Here we tested the association of maternal DNA sequence variants with preeclampsia in 20,064 cases and 703,117 control individuals and with gestational hypertension in 11,027 cases and 412,788 control individuals across discovery and follow-up cohorts using multi-ancestry meta-analysis. Altogether, we identified 18 independent loci associated with preeclampsia/eclampsia and/or gestational hypertension, 12 of which are new (for example, MTHFR-CLCN6, WNT3A, NPR3, PGR and RGL3), including two loci (PLCE1 and FURIN) identified in the multitrait analysis. Identified loci highlight the role of natriuretic peptide signaling, angiogenesis, renal glomerular function, trophoblast development and immune dysregulation. We derived genome-wide polygenic risk scores that predicted preeclampsia/eclampsia and gestational hypertension in external cohorts, independent of clinical risk factors, and reclassified eligibility for low-dose aspirin to prevent preeclampsia. Collectively, these findings provide mechanistic insights into the hypertensive disorders of pregnancy and have the potential to advance pregnancy risk stratification., (© 2023. The Author(s), under exclusive licence to Springer Nature America, Inc.)
- Published
- 2023
- Full Text
- View/download PDF
12. Quantitative prediction of right ventricular and size and function from the electrocardiogram.
- Author
-
Duong SQ, Vaid A, Vy HMT, Butler LR, Lampert J, Pass RH, Charney AW, Narula J, Khera R, Greenspan H, Gelb BD, Do R, and Nadkarni G
- Abstract
Background: Right ventricular ejection fraction (RVEF) and end-diastolic volume (RVEDV) are not readily assessed through traditional modalities. Deep-learning enabled 12-lead electrocardiogram analysis (DL-ECG) for estimation of RV size or function is unexplored., Methods: We trained a DL-ECG model to predict RV dilation (RVEDV>120 mL/m
2 ), RV dysfunction (RVEF≤40%), and numerical RVEDV/RVEF from 12-lead ECG paired with reference-standard cardiac MRI (cMRI) volumetric measurements in UK biobank (UKBB; n=42,938). We fine-tuned in a multi-center health system (MSHoriginal ; n=3,019) with prospective validation over 4 months (MSHvalidation ; n=115). We evaluated performance using area under the receiver operating curve (AUROC) for categorical and mean absolute error (MAE) for continuous measures overall and in key subgroups. We assessed association of RVEF prediction with transplant-free survival with Cox proportional hazards models., Results: Prevalence of RV dysfunction for UKBB/MSHoriginal /MSHvalidation cohorts was 1.0%/18.0%/15.7%, respectively. RV dysfunction model AUROC for UKBB/MSHoriginal /MSHvalidation cohorts was 0.86/0.81/0.77, respectively. Prevalence of RV dilation for UKBB/MSHoriginal /MSHvalidation cohorts was 1.6%/10.6%/4.3%. RV dilation model AUROC for UKBB/MSHoriginal /MSHvalidation cohorts 0.91/0.81/0.92, respectively. MSHoriginal MAE was RVEF=7.8% and RVEDV=17.6 ml/m2 . Performance was similar in key subgroups including with and without left ventricular dysfunction. Over median follow-up of 2.3 years, predicted RVEF was independently associated with composite outcome (HR 1.37 for each 10% decrease, p=0.046)., Conclusions: DL-ECG analysis can accurately identify significant RV dysfunction and dilation both overall and in key subgroups. Predicted RVEF is independently associated with clinical outcome.- Published
- 2023
- Full Text
- View/download PDF
13. Machine Learning Identifies Plasma Metabolites Associated With Heart Failure in Underrepresented Populations With the TTR V122I Variant.
- Author
-
Park JK, Petrazzini BO, Saha A, Vaid A, Vy HMT, Márquez-Luna C, Chan L, Nadkarni GN, and Do R
- Subjects
- Humans, Machine Learning, Prealbumin genetics, Prealbumin metabolism, Mutation, Heart Failure diagnosis, Heart Failure genetics, Heart Failure complications, Amyloidosis metabolism, Cardiomyopathies metabolism, Amyloid Neuropathies, Familial complications
- Published
- 2023
- Full Text
- View/download PDF
14. Genome-wide association and multi-trait analyses characterize the common genetic architecture of heart failure.
- Author
-
Levin MG, Tsao NL, Singhal P, Liu C, Vy HMT, Paranjpe I, Backman JD, Bellomo TR, Bone WP, Biddinger KJ, Hui Q, Dikilitas O, Satterfield BA, Yang Y, Morley MP, Bradford Y, Burke M, Reza N, Charest B, Judy RL, Puckelwartz MJ, Hakonarson H, Khan A, Kottyan LC, Kullo I, Luo Y, McNally EM, Rasmussen-Torvik LJ, Day SM, Do R, Phillips LS, Ellinor PT, Nadkarni GN, Ritchie MD, Arany Z, Cappola TP, Margulies KB, Aragam KG, Haggerty CM, Joseph J, Sun YV, Voight BF, and Damrauer SM
- Subjects
- Humans, Phenotype, Heart, Gene Expression Profiling, Polymorphism, Single Nucleotide, Genetic Predisposition to Disease, Genome-Wide Association Study methods, Heart Failure genetics
- Abstract
Heart failure is a leading cause of cardiovascular morbidity and mortality. However, the contribution of common genetic variation to heart failure risk has not been fully elucidated, particularly in comparison to other common cardiometabolic traits. We report a multi-ancestry genome-wide association study meta-analysis of all-cause heart failure including up to 115,150 cases and 1,550,331 controls of diverse genetic ancestry, identifying 47 risk loci. We also perform multivariate genome-wide association studies that integrate heart failure with related cardiac magnetic resonance imaging endophenotypes, identifying 61 risk loci. Gene-prioritization analyses including colocalization and transcriptome-wide association studies identify known and previously unreported candidate cardiomyopathy genes and cellular processes, which we validate in gene-expression profiling of failing and healthy human hearts. Colocalization, gene expression profiling, and Mendelian randomization provide convergent evidence for the roles of BCKDHA and circulating branch-chain amino acids in heart failure and cardiac structure. Finally, proteome-wide Mendelian randomization identifies 9 circulating proteins associated with heart failure or quantitative imaging traits. These analyses highlight similarities and differences among heart failure and associated cardiovascular imaging endophenotypes, implicate common genetic variation in the pathogenesis of heart failure, and identify circulating proteins that may represent cardiomyopathy treatment targets., (© 2022. This is a U.S. Government work and not under copyright protection in the US; foreign copyright protection may apply.)
- Published
- 2022
- Full Text
- View/download PDF
15. Cross-Ancestry Investigation of Venous Thromboembolism Genomic Predictors.
- Author
-
Thibord F, Klarin D, Brody JA, Chen MH, Levin MG, Chasman DI, Goode EL, Hveem K, Teder-Laving M, Martinez-Perez A, Aïssi D, Daian-Bacq D, Ito K, Natarajan P, Lutsey PL, Nadkarni GN, de Vries PS, Cuellar-Partida G, Wolford BN, Pattee JW, Kooperberg C, Braekkan SK, Li-Gao R, Saut N, Sept C, Germain M, Judy RL, Wiggins KL, Ko D, O'Donnell CJ, Taylor KD, Giulianini F, De Andrade M, Nøst TH, Boland A, Empana JP, Koyama S, Gilliland T, Do R, Huffman JE, Wang X, Zhou W, Manuel Soria J, Carlos Souto J, Pankratz N, Haessler J, Hindberg K, Rosendaal FR, Turman C, Olaso R, Kember RL, Bartz TM, Lynch JA, Heckbert SR, Armasu SM, Brumpton B, Smadja DM, Jouven X, Komuro I, Clapham KR, Loos RJF, Willer CJ, Sabater-Lleal M, Pankow JS, Reiner AP, Morelli VM, Ridker PM, Vlieg AVH, Deleuze JF, Kraft P, Rader DJ, Min Lee K, Psaty BM, Heidi Skogholt A, Emmerich J, Suchon P, Rich SS, Vy HMT, Tang W, Jackson RD, Hansen JB, Morange PE, Kabrhel C, Trégouët DA, Damrauer SM, Johnson AD, and Smith NL
- Subjects
- Genetic Predisposition to Disease, Genome-Wide Association Study, Genomics, Humans, Polymorphism, Single Nucleotide, Quantitative Trait Loci, Thrombosis genetics, Venous Thromboembolism diagnosis, Venous Thromboembolism genetics
- Abstract
Background: Venous thromboembolism (VTE) is a life-threatening vascular event with environmental and genetic determinants. Recent VTE genome-wide association studies (GWAS) meta-analyses involved nearly 30 000 VTE cases and identified up to 40 genetic loci associated with VTE risk, including loci not previously suspected to play a role in hemostasis. The aim of our research was to expand discovery of new genetic loci associated with VTE by using cross-ancestry genomic resources., Methods: We present new cross-ancestry meta-analyzed GWAS results involving up to 81 669 VTE cases from 30 studies, with replication of novel loci in independent populations and loci characterization through in silico genomic interrogations., Results: In our genetic discovery effort that included 55 330 participants with VTE (47 822 European, 6320 African, and 1188 Hispanic ancestry), we identified 48 novel associations, of which 34 were replicated after correction for multiple testing. In our combined discovery-replication analysis (81 669 VTE participants) and ancestry-stratified meta-analyses (European, African, and Hispanic), we identified another 44 novel associations, which are new candidate VTE-associated loci requiring replication. In total, across all GWAS meta-analyses, we identified 135 independent genomic loci significantly associated with VTE risk. A genetic risk score of the significantly associated loci in Europeans identified a 6-fold increase in risk for those in the top 1% of scores compared with those with average scores. We also identified 31 novel transcript associations in transcriptome-wide association studies and 8 novel candidate genes with protein quantitative-trait locus Mendelian randomization analyses. In silico interrogations of hemostasis and hematology traits and a large phenome-wide association analysis of the 135 GWAS loci provided insights to biological pathways contributing to VTE, with some loci contributing to VTE through well-characterized coagulation pathways and others providing new data on the role of hematology traits, particularly platelet function. Many of the replicated loci are outside of known or currently hypothesized pathways to thrombosis., Conclusions: Our cross-ancestry GWAS meta-analyses identified new loci associated with VTE. These findings highlight new pathways to thrombosis and provide novel molecules that may be useful in the development of improved antithrombosis treatments.
- Published
- 2022
- Full Text
- View/download PDF
16. Genome-Wide Epistatic Interaction between DEF1B and APOL1 High-Risk Genotypes for Chronic Kidney Disease.
- Author
-
Vy HMT, Lin BM, Gulamali FF, Kooperberg C, Graff M, Wong J, Campbell KN, Matise TC, Coresh J, Thomas F, Reiner AP, Nassir R, Schnatz PF, Johns T, Buyske S, Haiman C, Cooper R, Loos RJF, Horowitz CR, Gutierrez OM, Do R, Franceschini N, and Nadkarni GN
- Subjects
- Humans, Genotype, Genetic Predisposition to Disease, Apolipoprotein L1 genetics, Renal Insufficiency, Chronic genetics
- Published
- 2022
- Full Text
- View/download PDF
17. Epigenomic and transcriptomic analyses define core cell types, genes and targetable mechanisms for kidney disease.
- Author
-
Liu H, Doke T, Guo D, Sheng X, Ma Z, Park J, Vy HMT, Nadkarni GN, Abedini A, Miao Z, Palmer M, Voight BF, Li H, Brown CD, Ritchie MD, Shu Y, and Susztak K
- Subjects
- Animals, Genome-Wide Association Study, Humans, Mice, Polymorphism, Single Nucleotide genetics, Transcriptome genetics, Epigenomics, Kidney Diseases genetics
- Abstract
More than 800 million people suffer from kidney disease, yet the mechanism of kidney dysfunction is poorly understood. In the present study, we define the genetic association with kidney function in 1.5 million individuals and identify 878 (126 new) loci. We map the genotype effect on the methylome in 443 kidneys, transcriptome in 686 samples and single-cell open chromatin in 57,229 kidney cells. Heritability analysis reveals that methylation variation explains a larger fraction of heritability than gene expression. We present a multi-stage prioritization strategy and prioritize target genes for 87% of kidney function loci. We highlight key roles of proximal tubules and metabolism in kidney function regulation. Furthermore, the causal role of SLC47A1 in kidney disease is defined in mice with genetic loss of Slc47a1 and in human individuals carrying loss-of-function variants. Our findings emphasize the key role of bulk and single-cell epigenomic information in translating genome-wide association studies into identifying causal genes, cellular origins and mechanisms of complex traits., (© 2022. The Author(s), under exclusive licence to Springer Nature America, Inc.)
- Published
- 2022
- Full Text
- View/download PDF
18. Genetic pleiotropy of ERCC6 loss-of-function and deleterious missense variants links retinal dystrophy, arrhythmia, and immunodeficiency in diverse ancestries.
- Author
-
Forrest IS, Chaudhary K, Vy HMT, Bafna S, Kim S, Won HH, Loos RJF, Cho J, Pasquale LR, Nadkarni GN, Rocheleau G, and Do R
- Subjects
- Arrhythmias, Cardiac, DNA Helicases, DNA Repair Enzymes, Exome, Humans, Poly-ADP-Ribose Binding Proteins, Exome Sequencing methods, Genetic Pleiotropy, Retinal Dystrophies genetics
- Abstract
Biobanks with exomes linked to electronic health records (EHRs) enable the study of genetic pleiotropy between rare variants and seemingly disparate diseases. We performed robust clinical phenotyping of rare, putatively deleterious variants (loss-of-function [LoF] and deleterious missense variants) in ERCC6, a gene implicated in inherited retinal disease. We analyzed 213,084 exomes, along with a targeted set of retinal, cardiac, and immune phenotypes from two large-scale EHR-linked biobanks. In the primary analysis, a burden of deleterious variants in ERCC6 was strongly associated with (1) retinal disorders; (2) cardiac and electrocardiogram perturbations; and (3) immunodeficiency and decreased immunoglobulin levels. Meta-analysis of results from the BioMe Biobank and UK Biobank showed a significant association of deleterious ERCC6 burden with retinal dystrophy (odds ratio [OR] = 2.6, 95% confidence interval [CI]: 1.5-4.6; p = 8.7 × 10
-4 ), atypical atrial flutter (OR = 3.5, 95% CI: 1.9-6.5; p = 6.2 × 10-5 ), arrhythmia (OR = 1.5, 95% CI: 1.2-2.0; p = 2.7 × 10-3 ), and lymphocyte immunodeficiency (OR = 3.8, 95% CI: 2.1-6.8; p = 5.0 × 10-6 ). Carriers of ERCC6 LoF variants who lacked a diagnosis of these conditions exhibited increased symptoms, indicating underdiagnosis. These results reveal a unique genetic link among retinal, cardiac, and immune disorders and underscore the value of EHR-linked biobanks in assessing the full clinical profile of carriers of rare variants., (© 2021 Wiley Periodicals LLC.)- Published
- 2021
- Full Text
- View/download PDF
19. Molecular Analysis of the Kidney From a Patient With COVID-19-Associated Collapsing Glomerulopathy.
- Author
-
Meliambro K, Li X, Salem F, Yi Z, Sun Z, Chan L, Chung M, Chancay J, Vy HMT, Nadkarni G, Wong JS, Fu J, Lee K, Zhang W, He JC, and Campbell KN
- Abstract
Recent case reports suggest that coronavirus disease 2019 (COVID-19) is associated with collapsing glomerulopathy in African Americans with apolipoprotein L1 gene ( APOL1 ) risk alleles; however, it is unclear whether disease pathogenesis is similar to HIV-associated nephropathy. RNA sequencing analysis of a kidney biopsy specimen from a patient with COVID-19-associated collapsing glomerulopathy and APOL1 risk alleles (G1/G1) revealed similar levels of APOL1 and angiotensin-converting enzyme 2 ( ACE2 ) messenger RNA transcripts as compared with 12 control kidney samples downloaded from the GTEx (Genotype-Tissue Expression) Portal. Whole-genome sequencing of the COVID-19-associated collapsing glomerulopathy kidney sample identified 4 indel gene variants, 3 of which are of unknown significance with respect to chronic kidney disease and/or focal segmental glomerulosclerosis. Molecular profiling of the kidney demonstrated activation of COVID-19-associated cell injury pathways such as inflammation and coagulation. Evidence for direct severe acute respiratory syndrome coronavirus 2 infection of kidney cells was lacking, which is consistent with the findings of several recent studies. Interestingly, immunostaining of kidney biopsy sections revealed increased expression of phospho-STAT3 (signal transducer and activator of transcription 3) in both COVID-19-associated collapsing glomerulopathy and HIV-associated nephropathy as compared with control kidney tissue. Importantly, interleukin 6-induced activation of STAT3 may be a targetable mechanism driving COVID-19-associated acute kidney injury., (© 2021 The Authors.)
- Published
- 2021
- Full Text
- View/download PDF
20. Genome-wide polygenic risk score for retinopathy of type 2 diabetes.
- Author
-
Forrest IS, Chaudhary K, Paranjpe I, Vy HMT, Marquez-Luna C, Rocheleau G, Saha A, Chan L, Van Vleck T, Loos RJF, Cho J, Pasquale LR, Nadkarni GN, and Do R
- Subjects
- Adult, Aged, Black People genetics, Diabetes Mellitus, Type 2 complications, Diabetes Mellitus, Type 2 genetics, Diabetes Mellitus, Type 2 pathology, Diabetic Retinopathy complications, Diabetic Retinopathy genetics, Diabetic Retinopathy pathology, Hispanic or Latino genetics, Humans, Middle Aged, Multifactorial Inheritance genetics, Risk Assessment, Risk Factors, White People genetics, Diabetes Mellitus, Type 2 epidemiology, Diabetic Retinopathy epidemiology, Genetic Predisposition to Disease, Genome-Wide Association Study
- Abstract
Diabetic retinopathy (DR) is a common consequence in type 2 diabetes (T2D) and a leading cause of blindness in working-age adults. Yet, its genetic predisposition is largely unknown. Here, we examined the polygenic architecture underlying DR by deriving and assessing a genome-wide polygenic risk score (PRS) for DR. We evaluated the PRS in 6079 individuals with T2D of European, Hispanic, African and other ancestries from a large-scale multi-ethnic biobank. Main outcomes were PRS association with DR diagnosis, symptoms and complications, and time to diagnosis, and transferability to non-European ancestries. We observed that PRS was significantly associated with DR. A standard deviation increase in PRS was accompanied by an adjusted odds ratio (OR) of 1.12 [95% confidence interval (CI) 1.04-1.20; P = 0.001] for DR diagnosis. When stratified by ancestry, PRS was associated with the highest OR in European ancestry (OR = 1.22, 95% CI 1.02-1.41; P = 0.049), followed by African (OR = 1.15, 95% CI 1.03-1.28; P = 0.028) and Hispanic ancestries (OR = 1.10, 95% CI 1.00-1.10; P = 0.050). Individuals in the top PRS decile had a 1.8-fold elevated risk for DR versus the bottom decile (P = 0.002). Among individuals without DR diagnosis, the top PRS decile had more DR symptoms than the bottom decile (P = 0.008). The PRS was associated with retinal hemorrhage (OR = 1.44, 95% CI 1.03-2.02; P = 0.03) and earlier DR presentation (10% probability of DR by 4 years in the top PRS decile versus 8 years in the bottom decile). These results establish the significant polygenic underpinnings of DR and indicate the need for more diverse ancestries in biobanks to develop multi-ancestral PRS., (© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.)
- Published
- 2021
- Full Text
- View/download PDF
21. Genetically Downregulated Interleukin-6 Signaling Is Associated With a Favorable Cardiometabolic Profile: A Phenome-Wide Association Study.
- Author
-
Georgakis MK, Malik R, Li X, Gill D, Levin MG, Vy HMT, Judy R, Ritchie M, Verma SS, Nadkarni GN, Damrauer SM, Theodoratou E, and Dichgans M
- Subjects
- Down-Regulation, Humans, Signal Transduction, Genome-Wide Association Study methods, Interleukin-6 metabolism
- Published
- 2021
- Full Text
- View/download PDF
22. Probing the aggregated effects of purifying selection per individual on 1,380 medical phenotypes in the UK Biobank.
- Author
-
Vy HMT, Jordan DM, Balick DJ, and Do R
- Subjects
- Alleles, Biological Specimen Banks, Body Mass Index, Female, Gene Frequency, Genetic Association Studies, Genetic Predisposition to Disease, Genetic Variation genetics, Humans, Male, United Kingdom, Evolution, Molecular, Genetic Fitness genetics, Genetics, Population, Selection, Genetic genetics
- Abstract
Understanding the relationship between natural selection and phenotypic variation has been a long-standing challenge in human population genetics. With the emergence of biobank-scale datasets, along with new statistical metrics to approximate strength of purifying selection at the variant level, it is now possible to correlate a proxy of individual relative fitness with a range of medical phenotypes. We calculated a per-individual deleterious load score by summing the total number of derived alleles per individual after incorporating a weight that approximates strength of purifying selection. We assessed four methods for the weight, including GERP, phyloP, CADD, and fitcons. By quantitatively tracking each of these scores with the site frequency spectrum, we identified phyloP as the most appropriate weight. The phyloP-weighted load score was then calculated across 15,129,142 variants in 335,161 individuals from the UK Biobank and tested for association on 1,380 medical phenotypes. After accounting for multiple test correction, we observed a strong association of the load score amongst coding sites only on 27 traits including body mass, adiposity and metabolic rate. We further observed that the association signals were driven by common variants (derived allele frequency > 5%) with high phyloP score (phyloP > 2). Finally, through permutation analyses, we showed that the load score amongst coding sites had an excess of nominally significant associations on many medical phenotypes. These results suggest a broad impact of deleterious load on medical phenotypes and highlight the deleterious load score as a tool to disentangle the complex relationship between natural selection and medical phenotypes., Competing Interests: I have read the journal's policy and the authors of this manuscript have the following competing interests: RD has received research support from AstraZeneca and Goldfinch Bio, being a scientific co-founder, consultant and equity holder for Pensieve Health and being a consultant for Variant Bio, all not related to this work. The other authors have declared that no competing interests exist.
- Published
- 2021
- Full Text
- View/download PDF
23. Multiple Modes of Positive Selection Shaping the Patterns of Incomplete Selective Sweeps over African Populations of Drosophila melanogaster.
- Author
-
Vy HMT, Won YJ, and Kim Y
- Subjects
- Africa, Alleles, Animals, Biological Evolution, Databases, Nucleic Acid, Evolution, Molecular, Gene Frequency genetics, Genetic Variation, Genetics, Population methods, Genome, Insect, Haplotypes genetics, Heterozygote, Models, Genetic, Mutation, Drosophila melanogaster genetics, Selection, Genetic genetics
- Abstract
It remains a challenge in evolutionary genetics to elucidate how beneficial mutations arise and propagate in a population and how selective pressures on mutant alleles are structured over space and time. By identifying "sweeping haplotypes (SHs)" that putatively carry beneficial alleles and are increasing (or have increased) rapidly in frequency, and surveying the geographic distribution of SH frequencies, we can indirectly infer how selective sweeps unfold in time and thus which modes of positive selection underlie those sweeps. Using population genomic data from African Drosophila melanogaster, we identified SHs from 37 candidate loci under selection. At more than half of loci, we identify single SHs. However, many other loci harbor multiple independent SHs, namely soft selective sweeps, either due to parallel evolution across space or a high beneficial mutation rate. At about a quarter of the loci, intermediate SH frequencies are found across multiple populations, which cannot be explained unless a certain form of frequency-dependent positive selection, such as heterozygote advantage, is invoked given the reasonable range of migration rates between African populations. At one locus, many independent SHs are observed over multiple populations but always together with ancestral haplotypes. This complex pattern is compatible with a large number of mutational targets in a gene and frequency-dependent selection on new variants. We conclude that very diverse modes of positive selection are operating at different sets of loci in D. melanogaster populations., (© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.)
- Published
- 2017
- Full Text
- View/download PDF
Catalog
Discovery Service for Jio Institute Digital Library
For full access to our library's resources, please sign in.