Back to Search
Start Over
The Sequences of 150,119 Genomes in the UK Biobank
- Source :
- Halldorsson, B V, Eggertsson, H P, Moore, K H S, Hauswedell, H, Eiriksson, O, Ulfarsson, M O, Palsson, G, Hardarson, M T, Oddsson, A, Jensson, B O, Kristmundsdottir, S, Sigurpalsdottir, B D, Stefansson, O A, Beyter, D, Holley, G, Tragante, V, Gylfason, A, Olason, P I, Zink, F, Asgeirsdottir, M, Sverrisson, S T, Sigurdsson, B, Gudjonsson, S A, Sigurdsson, G T, Halldorsson, G H, Sveinbjornsson, G, Norland, K, Styrkarsdottir, U, Magnusdottir, D N, Snorradottir, S, Kristinsson, K, Sobech, E, Jonsson, H, Geirsson, A J, Olafsson, I, Jonsson, P, Pedersen, O B, Erikstrup, C, Brunak, S, Ostrowski, S R, DBDS Genetic Consortium, Nielsen, K R, Thorleifsson, G, Jonsson, F, Melsted, P, Jonsdottir, I, Rafnar, T, Holm, H, Stefansson, H, Saemundsdottir, J, Gudbjartsson, D F, Magnusson, O T, Masson, G, Thorsteinsdottir, U, Helgason, A, Jonsson, H, Sulem, P, Stefansson, K & Nyegaard, M 2022, ' The sequences of 150,119 genomes in the UK Biobank ', Nature, vol. 607, no. 7920, pp. 732-740 . https://doi.org/10.1038/s41586-022-04965-x, Halldorsson, B V, Eggertsson, H P, Moore, K H S, Hauswedell, H, Eiriksson, O, Ulfarsson, M O, Palsson, G, Hardarson, M T, Oddsson, A, Jensson, B O, Kristmundsdottir, S, Sigurpalsdottir, B D, Stefansson, O A, Beyter, D, Holley, G, Tragante, V, Gylfason, A, Olason, P I, Zink, F, Asgeirsdottir, M, Sverrisson, S T, Sigurdsson, B, Gudjonsson, S A, Sigurdsson, G T, Halldorsson, G H, Sveinbjornsson, G, Norland, K, Styrkarsdottir, U, Magnusdottir, D N, Snorradottir, S, Kristinsson, K, Sobech, E, Jonsson, H, Geirsson, A J, Olafsson, I, Jonsson, P, Pedersen, O B, Erikstrup, C, Brunak, S, Ostrowski, S R, DBDS Genetic Consortium, Thorleifsson, G, Jonsson, F, Melsted, P, Jonsdottir, I, Rafnar, T, Holm, H, Stefansson, H, Saemundsdottir, J, Gudbjartsson, D, Magnusson, O T, Masson, G, Thorsteinsdottir, U, Helgason, A, Jonsson, H, Sulem, P, Stefansson, K, Mikkelsen, S & Nyegaard, M 2022, ' The sequences of 150,119 genomes in the UK Biobank ', Nature, vol. 607, pp. 732-740 . https://doi.org/10.1038/s41586-022-04965-x, Halldorsson, B V, Eggertsson, H P, Moore, K H S, Hauswedell, H, Eiriksson, O, Ulfarsson, M O, Palsson, G, Hardarson, M T, Oddsson, A, Jensson, B O, Kristmundsdottir, S, Sigurpalsdottir, B D, Stefansson, O A, Beyter, D, Holley, G, Tragante, V, Gylfason, A, Olason, P I, Zink, F, Asgeirsdottir, M, Sverrisson, S T, Sigurdsson, B, Gudjonsson, S A, Sigurdsson, G T, Halldorsson, G H, Sveinbjornsson, G, Norland, K, Styrkarsdottir, U, Magnusdottir, D N, Snorradottir, S, Kristinsson, K, Sobech, E, Pedersen, O B, Brunak, S, Ostrowski, S R, Banasik, K, Burgdorf, K, Didriksen, M, Hansen, T F, Hjalgrim, H, Jemec, G, Jennum, P, Johansson, P I, Ostrowski, S R, Ullum, H, Werge, T & DBDS Genetic Consortium 2022, ' The sequences of 150,119 genomes in the UK Biobank ', Nature, vol. 607 . https://doi.org/10.1038/s41586-022-04965-x
- Publication Year :
- 2022
- Publisher :
- Nature Publishing Group, 2022.
-
Abstract
- Detailed knowledge of how diversity in the sequence of the human genome affects phenotypic diversity depends on a comprehensive and reliable characterization of both sequences and phenotypic variation. Over the past decade, insights into this relationship have been obtained from whole-exome sequencing or whole-genome sequencing of large cohorts with rich phenotypic data1,2. Here we describe the analysis of whole-genome sequencing of 150,119 individuals from the UK Biobank3. This constitutes a set of high-quality variants, including 585,040,410 single-nucleotide polymorphisms, representing 7.0% of all possible human single-nucleotide polymorphisms, and 58,707,036 indels. This large set of variants allows us to characterize selection based on sequence variation within a population through a depletion rank score of windows along the genome. Depletion rank analysis shows that coding exons represent a small fraction of regions in the genome subject to strong sequence conservation. We define three cohorts within the UK Biobank: a large British Irish cohort, a smaller African cohort and a South Asian cohort. A haplotype reference panel is provided that allows reliable imputation of most variants carried by three or more sequenced individuals. We identified 895,055 structural variants and 2,536,688 microsatellites, groups of variants typically excluded from large-scale whole-genome sequencing studies. Using this formidable new resource, we provide several examples of trait associations for rare variants with large effects not found previously through studies based on whole-exome sequencing and/or imputation.
- Subjects :
- Asia/ethnology
Multidisciplinary
Whole Genome Sequencing
Haplotypes/genetics
Genetic Variation
Ireland/ethnology
Genomics
Polymorphism, Single Nucleotide/genetics
Genome-wide association studies
United Kingdom
Cohort Studies
INDEL Mutation
Africa/ethnology
Databases, Genetic
Exons/genetics
Genetics research
Genome, Human/genetics
Humans
Genetic markers
DNA sequencing
Genetic variation
Conserved Sequence
Biological Specimen Banks
Microsatellite Repeats
Subjects
Details
- Language :
- English
- ISSN :
- 14764687 and 00280836
- Volume :
- 607
- Issue :
- 7920
- Database :
- OpenAIRE
- Journal :
- Nature
- Accession number :
- edsair.doi.dedup.....eca66958be3b25bece9bbe2a66f555d8
- Full Text :
- https://doi.org/10.1038/s41586-022-04965-x