1. An efficient multi-locus mixed-model approach for genome-wide association studies in structured populations
- Author
-
Magnus Nordborg, Bjarni J. Vilhjálmsson, Alexander Platt, Ümit Seren, Quan Long, Vincent Segura, Arthur Korte, Unité de recherche Amélioration, Génétique et Physiologie Forestières (AGPF), Institut National de la Recherche Agronomique (INRA), Austrian Academy of Sciences (OeAW), Gregor Mendel Institute of Molecular Plant Biology (GMI), Department of Molecular and Computational Biology, University of Southern California (USC), National Heart, Lung, and Blood Institute (NHLBI), Ecologie des Forets, Prairies et milieux Aquatiques (EFPA) department of INRA, Deutsche Forschungsgemeinschaft (DFG), US National Institutes of Health [P50 HG002790], European Union [283496], Austrian Academy of Sciences through GMI, Unité de recherche Amélioration, Génétique et Physiologie Forestières (UAGPF), Segura, Vincent, and Vilhjálmsson, Bjarni J
- Subjects
0106 biological sciences ,Mixed model ,False discovery rate ,Linkage disequilibrium ,Genotype ,Quantitative Trait Loci ,Arabidopsis ,Genome-wide association study ,Locus (genetics) ,polymorphisme ,Computational biology ,Molecular Dynamics Simulation ,Biology ,mlst ,Polymorphism, Single Nucleotide ,01 natural sciences ,Article ,Linkage Disequilibrium ,apparentement ,03 medical and health sciences ,Bayes' theorem ,stratification ,génétique d'association ,Population Groups ,Genetics ,sélection ,Humans ,[SDV.BV]Life Sciences [q-bio]/Vegetal Biology ,030304 developmental biology ,Genetic association ,modèle mixte ,0303 health sciences ,Vegetal Biology ,Models, Genetic ,Genome, Human ,qtl ,Chromosome Mapping ,Bayes Theorem ,variation génétique ,régression ,Genetic Loci ,Allelic heterogeneity ,étude d'association ,Biologie végétale ,Genome, Plant ,Genome-Wide Association Study ,010606 plant biology & botany - Abstract
Population structure causes genome-wide linkage disequilibrium between unlinked loci, leading to statistical confounding in genome-wide association studies. Mixed models have been shown to handle the confounding effects of a diffuse background of large numbers of loci of small effect well, but do not always account for loci of larger effect. Here we propose a multi-locus mixed model as a general method for mapping complex traits in structured populations. Simulations suggest that our method outperforms existing methods, in terms of power as well as false discovery rate. We apply our method to human and Arabidopsis thaliana data, identifying novel associations in known candidates as well as evidence for allelic heterogeneity. We also demonstrate how a priori knowledge from an A. thaliana linkage mapping study can be integrated into our method using a Bayesian approach. Our implementation is computationally efficient, making the analysis of large datasets (n > 10000) practicable.
- Published
- 2012
- Full Text
- View/download PDF