Back to Search
Start Over
Robust relationship inference in genome-wide association studies
- Source :
- Bioinformatics (Oxford, England). 26(22)
- Publication Year :
- 2010
-
Abstract
- Motivation: Genome-wide association studies (GWASs) have been widely used to map loci contributing to variation in complex traits and risk of diseases in humans. Accurate specification of familial relationships is crucial for family-based GWAS, as well as in population-based GWAS with unknown (or unrecognized) family structure. The family structure in a GWAS should be routinely investigated using the SNP data prior to the analysis of population structure or phenotype. Existing algorithms for relationship inference have a major weakness of estimating allele frequencies at each SNP from the entire sample, under a strong assumption of homogeneous population structure. This assumption is often untenable. Results: Here, we present a rapid algorithm for relationship inference using high-throughput genotype data typical of GWAS that allows the presence of unknown population substructure. The relationship of any pair of individuals can be precisely inferred by robust estimation of their kinship coefficient, independent of sample composition or population structure (sample invariance). We present simulation experiments to demonstrate that the algorithm has sufficient power to provide reliable inference on millions of unrelated pairs and thousands of relative pairs (up to 3rd-degree relationships). Application of our robust algorithm to HapMap and GWAS datasets demonstrates that it performs properly even under extreme population stratification, while algorithms assuming a homogeneous population give systematically biased results. Our extremely efficient implementation performs relationship inference on millions of pairs of individuals in a matter of minutes, dozens of times faster than the most efficient existing algorithm known to us. Availability: Our robust relationship inference algorithm is implemented in a freely available software package, KING, available for download at http://people.virginia.edu/∼wc9c/KING. Contact: wmchen@virginia.edu Supplementary information: Supplementary data are available at Bioinformatics online.
- Subjects :
- Statistics and Probability
Genotype
Population
Inference
Genome-wide association study
Sample (statistics)
Biology
Population stratification
computer.software_genre
Biochemistry
Polymorphism, Single Nucleotide
Population Groups
Humans
International HapMap Project
education
Molecular Biology
Allele frequency
Genetic association
Genetics
education.field_of_study
Genome, Human
Original Papers
Computer Science Applications
Computational Mathematics
Phenotype
Computational Theory and Mathematics
Data mining
computer
Algorithms
Genome-Wide Association Study
Subjects
Details
- ISSN :
- 13674811
- Volume :
- 26
- Issue :
- 22
- Database :
- OpenAIRE
- Journal :
- Bioinformatics (Oxford, England)
- Accession number :
- edsair.doi.dedup.....734c99e36ccb236ecd6bf6846ec0425b