1. The Impact of Purifying and Background Selection on the Inference of Population History: Problems and Prospects
- Author
-
Kellen Riall, Brian Charlesworth, Jeffrey D. Jensen, Laurent Excoffier, Hannes Becher, Parul Johri, and Von Haeseler, Arndt
- Subjects
Population ,Inference ,Biology ,AcademicSubjects/SCI01180 ,Polymorphism, Single Nucleotide ,Article ,Negative selection ,Genome Size ,Statistics ,Genetics ,Methods ,fastsimcoal2 ,Population growth ,Nuisance parameter ,Selection, Genetic ,education ,Molecular Biology ,Ecology, Evolution, Behavior and Systematics ,Selection (genetic algorithm) ,Mathematics ,Demography ,education.field_of_study ,Models, Genetic ,Estimation theory ,approximate Bayesian computation (ABC) ,Population size ,AcademicSubjects/SCI01130 ,Bayes Theorem ,Background selection ,demographic inference ,Markov Chains ,background selection ,Genetic Techniques ,MSMC ,570 Life sciences ,biology ,Genetic Fitness ,distribution of fitness effects - Abstract
Current procedures for inferring population history generally assume complete neutrality���that is, they neglect both direct selection and the effects of selection on linked sites.We here examine how the presence of direct purifying selection and background selection may bias demographic inference by evaluating two commonly-used methods (MSMC and fastsimcoal2), specifically studying how the underlying shape of the distribution of fitness effects and the fraction of directly selected sites interact with demographic parameter estimation. The results show that, even after masking functional genomic regions, background selection may cause the mis-inference of population growth under models of both constant population size and decline. This effect is amplified as the strength of purifying selection and the density of directly selected sites increases, as indicated by the distortion of the site frequency spectrum and levels of nucleotide diversity at linked neutral sites. We also show how simulated changes in background selection effects caused by population size changes can be predicted analytically.We propose a potential method for correcting for the mis-inference of population growth caused by selection. By treating the distribution of fitness effect as a nuisance parameter and averaging across all potential realizations, we demonstrate that even directly selected sites can be used to infer demographic histories with reasonable accuracy. Key words: demographic inference, background selection, distribution of fitness effects, MSMC, fastsimcoal2, approximate Bayesian computation (ABC).
- Published
- 2021