1. Rare coding variant analysis for human diseases across biobanks and ancestries.
- Author
-
Jurgens SJ, Wang X, Choi SH, Weng LC, Koyama S, Pirruccello JP, Nguyen T, Smadbeck P, Jang D, Chaffin M, Walsh R, Roselli C, Elliott AL, Wijdeveld LFJM, Biddinger KJ, Kany S, Rämö JT, Natarajan P, Aragam KG, Flannick J, Burtt NP, Bezzina CR, Lubitz SA, Lunetta KL, and Ellinor PT
- Subjects
- Humans, Genetic Variation, Genetic Predisposition to Disease, White People genetics, Disease genetics, Genome-Wide Association Study, Biological Specimen Banks
- Abstract
Large-scale sequencing has enabled unparalleled opportunities to investigate the role of rare coding variation in human phenotypic variability. Here, we present a pan-ancestry analysis of sequencing data from three large biobanks, including the All of Us research program. Using mixed-effects models, we performed gene-based rare variant testing for 601 diseases across 748,879 individuals, including 155,236 with ancestry dissimilar to European. We identified 363 significant associations, which highlighted core genes for the human disease phenome and identified potential novel associations, including UBR3 for cardiometabolic disease and YLPM1 for psychiatric disease. Pan-ancestry burden testing represented an inclusive and useful approach for discovery in diverse datasets, although we also highlight the importance of ancestry-specific sensitivity analyses in this setting. Finally, we found that effect sizes for rare protein-disrupting variants were concordant between samples similar to European ancestry and other genetic ancestries (β
Deming = 0.7-1.0). Our results have implications for multi-ancestry and cross-biobank approaches in sequencing association studies for human disease., (© 2024. The Author(s), under exclusive licence to Springer Nature America, Inc.)- Published
- 2024
- Full Text
- View/download PDF