1. A Continuous Statistical Phasing Framework for the Analysis of Forensic Mitochondrial DNA Mixtures
- Author
-
Bruce Budowle, Melissa Muenzler, August E. Woerner, Utpal Smart, Sammed N. Mandape, Jonathan L. King, and Jennifer Churchill Cihlar
- Subjects
0301 basic medicine ,Forensic Genetics ,lcsh:QH426-470 ,population genomics ,In silico ,Population ,Bayesian inference ,Ion Torrent ,DNA, Mitochondrial ,Polymorphism, Single Nucleotide ,Haplogroup ,Article ,03 medical and health sciences ,DEploid ,0302 clinical medicine ,mtDNA mixture deconvolution ,Genetics ,Humans ,030216 legal & forensic medicine ,education ,Genetics (clinical) ,Mathematics ,education.field_of_study ,Massive parallel sequencing ,Models, Statistical ,bayesian inference ,massively parallel sequencing ,Computational Biology ,High-Throughput Nucleotide Sequencing ,Reproducibility of Results ,Bayes Theorem ,Ion semiconductor sequencing ,bioinformatics ,Genomics ,Sequence Analysis, DNA ,lcsh:Genetics ,030104 developmental biology ,computational phasing ,Genome, Mitochondrial ,Deconvolution ,Biological system ,Haplotype estimation ,Algorithms - Abstract
Despite the benefits of quantitative data generated by massively parallel sequencing, resolving mitotypes from mixtures occurring in certain ratios remains challenging. In this study, a bioinformatic mixture deconvolution method centered on population-based phasing was developed and validated. The method was first tested on 270 in silico two-person mixtures varying in mixture proportions. An assortment of external reference panels containing information on haplotypic variation (from similar and different haplogroups) was leveraged to assess the effect of panel composition on phasing accuracy. Building on these simulations, mitochondrial genomes from the Human Mitochondrial DataBase were sourced to populate the panels and key parameter values were identified by deconvolving an additional 7290 in silico two-person mixtures. Finally, employing an optimized reference panel and phasing parameters, the approach was validated with in vitro two-person mixtures with differing proportions. Deconvolution was most accurate when the haplotypes in the mixture were similar to haplotypes present in the reference panel and when the mixture ratios were neither highly imbalanced nor subequal (e.g., 4:1). Overall, errors in haplotype estimation were largely bounded by the accuracy of the mixture’s genotype results. The proposed framework is the first available approach that automates the reconstruction of complete individual mitotypes from mixtures, even in ratios that have traditionally been considered problematic.
- Published
- 2021