Back to Search Start Over

A fast algorithm for genome-wide haplotype pattern mining.

Authors :
Besenbacher, Søren
Pedersen, Christian N. S.
Mailund, Thomas
Source :
BMC Bioinformatics. 2009 Supplement 1, Vol. 10, Special section p1-8. 8p. 1 Diagram, 2 Charts, 5 Graphs.
Publication Year :
2009

Abstract

Background: Identifying the genetic components of common diseases has long been an important area of research. Recently, genotyping technology has reached the level where it is cost effective to genotype single nucleotide polymorphism (SNP) markers covering the entire genome, in thousands of individuals, and analyse such data for markers associated with a diseases. The statistical power to detect association, however, is limited when markers are analysed one at a time. This can be alleviated by considering multiple markers simultaneously. The Haplotype Pattern Mining (HPM) method is a machine learning approach to do exactly this. Results: We present a new, faster algorithm for the HPM method. The new approach use patterns of haplotype diversity in the genome: locally in the genome, the number of observed haplotypes is much smaller than the total number of possible haplotypes. We show that the new approach speeds up the HPM method with a factor of 2 on a genome-wide dataset with 5009 individuals typed in 491208 markers using default parameters and more if the pattern length is increased. Conclusion: The new algorithm speeds up the HPM method and we show that it is feasible to apply HPM to whole genome association mapping with thousands of individuals and hundreds of thousands of markers. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14712105
Volume :
10
Database :
Academic Search Index
Journal :
BMC Bioinformatics
Publication Type :
Academic Journal
Accession number :
54372956
Full Text :
https://doi.org/10.1186/1471-2105-10-S1-S74