Back to Search Start Over

Low-coverage sequencing cost-effectively detects known and novel variation in underrepresented populations

Authors :
NeuroGAP-Psychosis Study Team
Roxanne James
Carter P. Newman
Rocky E. Stroud
Abebaw Fekadu
Joseph Kyebuzibwa
Sheila Dodge
Elizabeth G. Atkinson
Lerato Majara
Dan J. Stein
Zukiswa Zingela
Wilfred E. Injera
Lori B. Chibnik
Mark J. Daly
Raj Ramesar
Karestan C. Koenen
Anne Stevenson
Tamrat Abebe
Solomon Teferra
Gabriel Kigen
Timothy DeSmet
Symon M. Kariuki
Joseph K. Pickrell
Stella Gichuru
Melkam Alemayehu
Fred K. Ashaba
Welelta Shiferaw
Lukoye Atwoli
Dickens Akena
Edith Kwobah
Charles R. Newton
Tera Bowers
Steven Ferriera
Rehema M. Mwema
Benjamin M. Neale
Bizu Gelaye
Alicia R. Martin
Henry Musinguzi
Celia van der Merwe
Sinéad B. Chapman
Team, NeuroGAP-Psychosis Study
Source :
Am J Hum Genet
Publication Year :
2020

Abstract

BackgroundGenetic studies of biomedical phenotypes in underrepresented populations identify disproportionate numbers of novel associations. However, current genomics infrastructure--including most genotyping arrays and sequenced reference panels--best serves populations of European descent. A critical step for facilitating genetic studies in underrepresented populations is to ensure that genetic technologies accurately capture variation in all populations. Here, we quantify the accuracy of low-coverage sequencing in diverse African populations.ResultsWe sequenced the whole genomes of 91 individuals to high-coverage (≥20X) from the Neuropsychiatric Genetics of African Population-Psychosis (NeuroGAP-Psychosis) study, in which participants were recruited from Ethiopia, Kenya, South Africa, and Uganda. We empirically tested two data generation strategies, GWAS arrays versus low-coverage sequencing, by calculating the concordance of imputed variants from these technologies with those from deep whole genome sequencing data. We show that low-coverage sequencing at a depth of ≥4X captures variants of all frequencies more accurately than all commonly used GWAS arrays investigated and at a comparable cost. Lower depths of sequencing (0.5-1X) performed comparable to commonly used low-density GWAS arrays. Low-coverage sequencing is also sensitive to novel variation, with 4X sequencing detecting 45% of singletons and 95% of common variants identified in high-coverage African whole genomes.ConclusionThese results indicate that low-coverage sequencing approaches surmount the problems induced by the ascertainment of common genotyping arrays, including those that capture variation most common in Europeans and Africans. Low-coverage sequencing effectively identifies novel variation (particularly in underrepresented populations), and presents opportunities to enhance variant discovery at a similar cost to traditional approaches.

Details

ISSN :
15376605
Volume :
108
Issue :
4
Database :
OpenAIRE
Journal :
American journal of human genetics
Accession number :
edsair.doi.dedup.....333c6c8cce23eb2d2e8cd884f6b1d7cd