Back to Search Start Over

Simultaneous estimation of genotype error and uncalled deletion rates in whole genome sequence data.

Authors :
Masaki N
Browning SR
Browning BL
Source :
PLoS genetics [PLoS Genet] 2024 May 24; Vol. 20 (5), pp. e1011297. Date of Electronic Publication: 2024 May 24 (Print Publication: 2024).
Publication Year :
2024

Abstract

Genotype data include errors that may influence conclusions reached by downstream statistical analyses. Previous studies have estimated genotype error rates from discrepancies in human pedigree data, such as Mendelian inconsistent genotypes or apparent phase violations. However, uncalled deletions, which generally have not been accounted for in these studies, can lead to biased error rate estimates. In this study, we propose a genotype error model that considers both genotype errors and uncalled deletions when calculating the likelihood of the observed genotypes in parent-offspring trios. Using simulations, we show that when there are uncalled deletions, our model produces genotype error rate estimates that are less biased than estimates from a model that does not account for these deletions. We applied our model to SNVs in 77 sequenced White British parent-offspring trios in the UK Biobank. We use the Akaike information criterion to show that our model fits the data better than a model that does not account for uncalled deletions. We estimate the genotype error rate at SNVs with minor allele frequency > 0.001 in these data to be [Formula: see text]. We estimate that 77% of the genotype errors at these markers are attributable to uncalled deletions [Formula: see text].<br />Competing Interests: The authors have declared that no competing interests exist.<br /> (Copyright: © 2024 Masaki et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)

Details

Language :
English
ISSN :
1553-7404
Volume :
20
Issue :
5
Database :
MEDLINE
Journal :
PLoS genetics
Publication Type :
Academic Journal
Accession number :
38787916
Full Text :
https://doi.org/10.1371/journal.pgen.1011297