Back to Search Start Over

Whole exome sequencing and characterization of coding variation in 49,960 individuals in the UK Biobank

Authors :
Ida Surakka
Alexander Lopez
Lukas Habegger
Shareef Khalid
William J Salerno
Andrew Blumenfeld
Bin Ye
Jeffrey G. Reid
Ashutosh K. Pandey
Alicia Hawes
John D. Overton
Giovanni Coppola
Jonathan Marchini
Wendy K. Chung
David J. Carey
David H. Ledbetter
Kristian Hveem
Aris N. Economides
Cristopher V. Van Hout
Kavita Praveen
Cristen J. Willer
Joshua D. Backman
Anthony Marcketta
Ioanna Tachmazidou
Marcus B. Jones
O’Dushlaine Colm
John Penn
Joseph B. Leader
Leland Barnard
Daren Liu
George D. Yancopoulos
Michael N. Cantor
Suganthi Balasubramanian
Laura M. Yerges-Armstrong
Alan R. Shuldiner
Claudia Gonzaga-Jauregui
John C. Whittaker
Nilanjana Banerjee
Aris Baras
Gonçalo R. Abecasis
Claudia Schurmann
Robert A. Scott
Evan Maxwell
Matthew R. Nelson
Jeffrey Staples
Ashish Yadav
Joshua D. Hoffman
Lon R. Cardon
Alexander H. Li
Publication Year :
2019
Publisher :
Cold Spring Harbor Laboratory, 2019.

Abstract

SUMMARYThe UK Biobank is a prospective study of 502,543 individuals, combining extensive phenotypic and genotypic data with streamlined access for researchers around the world. Here we describe the first tranche of large-scale exome sequence data for 49,960 study participants, revealing approximately 4 million coding variants (of which ~98.4% have frequency < 1%). The data includes 231,631 predicted loss of function variants, a >10-fold increase compared to imputed sequence for the same participants. Nearly all genes (>97%) had ≥1 predicted loss of function carrier, and most genes (>69%) had ≥10 loss of function carriers. We illustrate the power of characterizing loss of function variation in this large population through association analyses across 1,741 phenotypes. In addition to replicating a range of established associations, we discover novel loss of function variants with large effects on disease traits, including PIEZO1 on varicose veins, COL6A1 on corneal resistance, MEPE on bone density, and IQGAP2 and GMPR on blood cell traits. We further demonstrate the value of exome sequencing by surveying the prevalence of pathogenic variants of clinical significance in this population, finding that 2% of the population has a medically actionable variant. Additionally, we leverage the phenotypic data to characterize the relationship between rare BRCA1 and BRCA2 pathogenic variants and cancer risk. Exomes from the first 49,960 participants are now made accessible to the scientific community and highlight the promise offered by genomic sequencing in large-scale population-based studies.

Details

Database :
OpenAIRE
Accession number :
edsair.doi.dedup.....ed87f8ff1c2cbdc9569dbaa5b001bd62
Full Text :
https://doi.org/10.1101/572347