Back to Search Start Over

GenXHC: a probabilistic generative model for cross-hybridization compensation in high-density genome-wide microarray data

Authors :
Huang, Jim C.
Morris, Quaid D.
Hughes, Timothy R.
Frey, Brendan J.
Source :
Bioinformatics; June 2005, Vol. 21 Issue: Supplement 1 pi222-i222, 1p
Publication Year :
2005

Abstract

Motivation: Microarray designs containing millions to hundreds of millions of probes that tile entire genomes are currently being released. Within the next 2 months, our group will release a microarray data set containing over 12 000 000 microarray measurements taken from 37 mouse tissues. A problem that will become increasingly significant in the upcoming era of genome-wide exon-tiling microarray experiments is the removal of cross-hybridization noise. We present a probabilistic generative model for cross-hybridization in microarray data and a corresponding variational learning method for cross-hybridization compensation, GenXHC, that reduces cross-hybridization noise by taking into account multiple sources for each mRNA expression level measurement, as well as prior knowledge of hybridization similarities between the nucleotide sequences of microarray probes and their target cDNAs. Results: The algorithm is applied to a subset of an exon-resolution genome-wide Agilent microarray data set for chromosome 16 of Mus musculus</it> and is found to produce statistically significant reductions in cross-hybridization noise. The denoised data is found to produce enrichment in multiple gene ontology–biological process (GO–BP) functional groups. The algorithm is found to outperform robust multi-array analysis, another method for cross-hybridization compensation. Contact: <inter-ref locator="jim@psi.toronto.edu" locator-type="email">jim@psi.toronto.edu</inter-ref>

Details

Language :
English
ISSN :
13674803 and 13674811
Volume :
21
Issue :
Supplement 1
Database :
Supplemental Index
Journal :
Bioinformatics
Publication Type :
Periodical
Accession number :
ejs7380642
Full Text :
https://doi.org/10.1093/bioinformatics/bti1045