Start Over

Detecting selection in population trees: the Lewontin and Krakauer test extended

Authors :: Sarah C. Blott
Simon Boitard
Bertrand Servin
Magali SanCristobal
Jihad Abdallah
Maxime Bonhomme
Claude Chevalet
Laboratoire de Génétique Cellulaire (LGC)
Ecole Nationale Vétérinaire de Toulouse (ENVT)
Institut National Polytechnique (Toulouse) (Toulouse INP)
Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Institut National Polytechnique (Toulouse) (Toulouse INP)
Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées-Institut National de la Recherche Agronomique (INRA)
Institut National de la Recherche Agronomique (INRA)-Ecole Nationale Vétérinaire de Toulouse (ENVT)
Université Fédérale Toulouse Midi-Pyrénées-Université Fédérale Toulouse Midi-Pyrénées
Animal Health Trust (AHT)
Source :: Genetics, Genetics, Genetics Society of America, 2010, 186 (1), pp.241-62. ⟨10.1534/genetics.104.117275⟩, Genetics, Genetics Society of America, 2010, 186 (1), pp.241-262. ⟨10.1534/genetics.110.117275⟩
Publication Year :: 2010
Publisher :: HAL CCSD, 2010.
Abstract: Detecting genetic signatures of selection is of great interest for many research issues. Common approaches to separate selective from neutral processes focus on the variance of FST across loci, as does the original Lewontin and Krakauer (LK) test. Modern developments aim to minimize the false positive rate and to increase the power, by accounting for complex demographic structures. Another stimulating goal is to develop straightforward parametric and computationally tractable tests to deal with massive SNP data sets. Here, we propose an extension of the original LK statistic (TLK), named TF–LK, that uses a phylogenetic estimation of the population's kinship (\documentclass[10pt]{article} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{pmc} \usepackage[Euler]{upgreek} \pagestyle{empty} \oddsidemargin -1.0in \begin{document} \begin{equation*}{\mathrm{{\mathscr{F}}}}\end{equation*}\end{document}) matrix, thus accounting for historical branching and heterogeneity of genetic drift. Using forward simulations of single-nucleotide polymorphisms (SNPs) data under neutrality and selection, we confirm the relative robustness of the LK statistic (TLK) to complex demographic history but we show that TF–LK is more powerful in most cases. This new statistic outperforms also a multinomial-Dirichlet-based model [estimation with Markov chain Monte Carlo (MCMC)], when historical branching occurs. Overall, TF–LK detects 15–35% more selected SNPs than TLK for low type I errors (P < 0.001). Also, simulations show that TLK and TF–LK follow a chi-square distribution provided the ancestral allele frequencies are not too extreme, suggesting the possible use of the chi-square distribution for evaluating significance. The empirical distribution of TF–LK can be derived using simulations conditioned on the estimated \documentclass[10pt]{article} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{pmc} \usepackage[Euler]{upgreek} \pagestyle{empty} \oddsidemargin -1.0in \begin{document} \begin{equation*}{\mathrm{{\mathscr{F}}}}\end{equation*}\end{document} matrix. We apply this new test to pig breeds SNP data and pinpoint outliers using TF–LK, otherwise undetected using the less powerful TLK statistic. This new test represents one solution for compromise between advanced SNP genetic data acquisition and outlier analyses.

Subjects :: 0106 biological sciences
Genetic Markers
Swine
[SDV]Life Sciences [q-bio]
MESH: Selection, Genetic
Population
MESH: Genetics, Population
Biology
Investigations
MESH: Genetic Markers
010603 evolutionary biology
01 natural sciences
Polymorphism, Single Nucleotide
Evolution, Molecular
03 medical and health sciences
symbols.namesake
Genetic drift
Genetics
Animals
MESH: Animals
MESH: Models, Genetic
Selection, Genetic
education
MESH: Genetic Drift
MESH: Phylogeny
MESH: Swine
Statistic
Phylogeny
MESH: Evolution, Molecular
030304 developmental biology
Parametric statistics
0303 health sciences
education.field_of_study
[STAT.AP]Statistics [stat]/Applications [stat.AP]
[SDV.GEN.GPO]Life Sciences [q-bio]/Genetics/Populations and Evolution [q-bio.PE]
Models, Genetic
MESH: Polymorphism, Single Nucleotide
Genetic Drift
Markov chain Monte Carlo
Empirical distribution function
Genetics, Population
Outlier
symbols
False positive rate

Details

Language :: English
ISSN :: 00166731
Database :: OpenAIRE
Journal :: Genetics, Genetics, Genetics Society of America, 2010, 186 (1), pp.241-62. ⟨10.1534/genetics.104.117275⟩, Genetics, Genetics Society of America, 2010, 186 (1), pp.241-262. ⟨10.1534/genetics.110.117275⟩
Accession number :: edsair.doi.dedup.....ee93f8429d0bcf99eb33483838330bad

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Detecting selection in population trees: the Lewontin and Krakauer test extended

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Detecting selection in population trees: the Lewontin and Krakauer test extended

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources