Back to Search Start Over

Reconstituting the frequency spectrum of ascertained single-nucleotide polymorphism data.

Authors :
Nielsen R
Hubisz MJ
Clark AG
Source :
Genetics [Genetics] 2004 Dec; Vol. 168 (4), pp. 2373-82. Date of Electronic Publication: 2004 Sep 15.
Publication Year :
2004

Abstract

Most of the available SNP data have eluded valid population genetic analysis because most population genetical methods do not correctly accommodate the special discovery process used to identify SNPs. Most of the available SNP data have allele frequency distributions that are biased by the ascertainment protocol. We here show how this problem can be corrected by obtaining maximum-likelihood estimates of the true allele frequency distribution. In simple cases, the ML estimate of the true allele frequency distribution can be obtained analytically, but in other cases computational methods based on numerical optimization or the EM algorithm must be used. We illustrate the new correction method by analyzing some previously published SNP data from the SNP Consortium. Appropriate treatment of SNP ascertainment is vital to our ability to make correct inferences from the data of the International HapMap Project.

Details

Language :
English
ISSN :
0016-6731
Volume :
168
Issue :
4
Database :
MEDLINE
Journal :
Genetics
Publication Type :
Academic Journal
Accession number :
15371362
Full Text :
https://doi.org/10.1534/genetics.104.031039