Back to Search Start Over

Leveraging base-pair mammalian constraint to understand genetic variation and human disease

Authors :
Swedish Research Council
Knut and Alice Wallenberg Foundation
Swedish Cancer Society
Swedish Childhood Cancer Foundation
National Institute of Mental Health (US)
Gladstone Institutes
National Institute on Drug Abuse (US)
University College Dublin
National Human Genome Research Institute (US)
National Institutes of Health (US)
National Health and Medical Research Council (Australia)
Juan, David [0000-0003-1912-9667]
Marqués-Bonet, Tomàs [0000-0002-5597-3075]
Muntané, Gerard [0000-0003-1541-8365]
Navarro, Arcadi [0000-0003-2162-8246]
Valenzuela, Alejandro [0000-0001-6120-6246]
Sullivan, Patrick F.
Meadows, Jennifer R. S.
Gazal, Steven
Phan, BaDoi N.
Li, Xue
Genereux, Diane P.
Dong, Michael X.
Bianchi, Matteo
Andrews, Gregory
Sakthikumar, Sharadha
Nordin, Jessika
Roy, Ananya
Christmas, Matthew J.
Marinescu, Voichita D.
Wang, Chao
Wallerman, Ola
Xue, James
Yao, Shuyang
Sun, Quan
Szatkiewicz, Jin
Wen, Jia
Huckins, Laura M.
Lawler, Alyssa
Keough, Kathleen C.
Zheng, Zhili
Zeng, Jian
Wray, Naomi R.
Li, Yun
Johnson, Jessica
Chen, Jiawen
Zoonomia Consortium
Juan, David
Marqués-Bonet, Tomàs
Muntané, Gerard
Navarro, Arcadi
Serres-Armero, Aitor
Valenzuela, Alejandro
Paten, Benedict
Reilly, Steven K.
Hughes, Graham M.
Weng, Zhiping
Pollard, Katherine S.
Pfenning, Andreas R.
Forsberg-Nilsson, Karin
Karlsson, Elinor K.
Lindblad-Toh, Kerstin
Swedish Research Council
Knut and Alice Wallenberg Foundation
Swedish Cancer Society
Swedish Childhood Cancer Foundation
National Institute of Mental Health (US)
Gladstone Institutes
National Institute on Drug Abuse (US)
University College Dublin
National Human Genome Research Institute (US)
National Institutes of Health (US)
National Health and Medical Research Council (Australia)
Juan, David [0000-0003-1912-9667]
Marqués-Bonet, Tomàs [0000-0002-5597-3075]
Muntané, Gerard [0000-0003-1541-8365]
Navarro, Arcadi [0000-0003-2162-8246]
Valenzuela, Alejandro [0000-0001-6120-6246]
Sullivan, Patrick F.
Meadows, Jennifer R. S.
Gazal, Steven
Phan, BaDoi N.
Li, Xue
Genereux, Diane P.
Dong, Michael X.
Bianchi, Matteo
Andrews, Gregory
Sakthikumar, Sharadha
Nordin, Jessika
Roy, Ananya
Christmas, Matthew J.
Marinescu, Voichita D.
Wang, Chao
Wallerman, Ola
Xue, James
Yao, Shuyang
Sun, Quan
Szatkiewicz, Jin
Wen, Jia
Huckins, Laura M.
Lawler, Alyssa
Keough, Kathleen C.
Zheng, Zhili
Zeng, Jian
Wray, Naomi R.
Li, Yun
Johnson, Jessica
Chen, Jiawen
Zoonomia Consortium
Juan, David
Marqués-Bonet, Tomàs
Muntané, Gerard
Navarro, Arcadi
Serres-Armero, Aitor
Valenzuela, Alejandro
Paten, Benedict
Reilly, Steven K.
Hughes, Graham M.
Weng, Zhiping
Pollard, Katherine S.
Pfenning, Andreas R.
Forsberg-Nilsson, Karin
Karlsson, Elinor K.
Lindblad-Toh, Kerstin
Publication Year :
2023

Abstract

[INTRODUCTION] Thousands of genetic variants have been associated with human diseases and traits through genome-wide association studies (GWASs). Translating these discoveries into improved therapeutics requires discerning which variants among hundreds of candidates are causally related to disease risk. To date, only a handful of causal variants have been confirmed. Here, we leverage 100 million years of mammalian evolution to address this major challenge.<br />[RATIONALE] We compared genomes from hundreds of mammals and identified bases with unusually few variants (evolutionarily constrained). Constraint is a measure of functional importance that is agnostic to cell type or developmental stage. It can be applied to investigate any heritable disease or trait and is complementary to resources using cell type– and time point–specific functional assays like Encyclopedia of DNA Elements (ENCODE) and Genotype-Tissue Expression (GTEx).<br />[RESULTS] Using constraint calculated across placental mammals, 3.3% of bases in the human genome are significantly constrained, including 57.6% of coding bases. Most constrained bases (80.7%) are noncoding. Common variants (allele frequency ≥ 5%) and low-frequency variants (0.5% ≤ allele frequency < 5%) are depleted for constrained bases (1.85 versus 3.26% expected by chance, P < 2.2 × 10−308). Pathogenic ClinVar variants are more constrained than benign variants (P < 2.2 × 10−16). The most constrained common variants are more enriched for disease single-nucleotide polymorphism (SNP)–heritability in 63 independent GWASs. The enrichment of SNP-heritability in constrained regions is greater (7.8-fold) than previously reported in mammals and is even higher in primates (11.1-fold). It exceeds the enrichment of SNP-heritability in nonsynonymous coding variants (7.2-fold) and fine-mapped expression quantitative trait loci (eQTL)–SNPs (4.8-fold). The enrichment peaks near constrained bases, with a log-linear decrease of SNP-heritability enrichment as a function of the distance to a constrained base. Zoonomia constraint scores improve functionally informed fine-mapping. Variants at sites constrained in mammals and primates have greater posterior inclusion probabilities and higher per-SNP contributions. In addition, using both constraint and functional annotations improves polygenic risk score accuracy across a range of traits. Finally, incorporating constraint information into the analysis of noncoding somatic variants in medulloblastomas identifies new candidate driver genes.<br />[CONCLUSION] Genome-wide measures of evolutionary constraint can help discern which variants are functionally important. This information may accelerate the translation of genomic discoveries into the biological, clinical, and therapeutic knowledge that is required to understand and treat human disease.

Details

Database :
OAIster
Notes :
English
Publication Type :
Electronic Resource
Accession number :
edsoai.on1431963244
Document Type :
Electronic Resource