Back to Search
Start Over
Automatic annotation of spatial expression patterns via sparse Bayesian factor models.
- Source :
-
PLoS computational biology [PLoS Comput Biol] 2011 Jul; Vol. 7 (7), pp. e1002098. Date of Electronic Publication: 2011 Jul 21. - Publication Year :
- 2011
-
Abstract
- Advances in reporters for gene expression have made it possible to document and quantify expression patterns in 2D-4D. In contrast to microarrays, which provide data for many genes but averaged and/or at low resolution, images reveal the high spatial dynamics of gene expression. Developing computational methods to compare, annotate, and model gene expression based on images is imperative, considering that available data are rapidly increasing. We have developed a sparse Bayesian factor analysis model in which the observed expression diversity of among a large set of high-dimensional images is modeled by a small number of hidden common factors. We apply this approach on embryonic expression patterns from a Drosophila RNA in situ image database, and show that the automatically inferred factors provide for a meaningful decomposition and represent common co-regulation or biological functions. The low-dimensional set of factor mixing weights is further used as features by a classifier to annotate expression patterns with functional categories. On human-curated annotations, our sparse approach reaches similar or better classification of expression patterns at different developmental stages, when compared to other automatic image annotation methods using thousands of hard-to-interpret features. Our study therefore outlines a general framework for large microscopy data sets, in which both the generative model itself, as well as its application for analysis tasks such as automated annotation, can provide insight into biological questions.
- Subjects :
- Algorithms
Animals
Area Under Curve
Artificial Intelligence
Cluster Analysis
Drosophila melanogaster genetics
Drosophila melanogaster metabolism
Gene Expression Regulation, Developmental
Humans
Models, Biological
Oligonucleotide Array Sequence Analysis
Bayes Theorem
Computational Biology methods
Gene Expression Profiling methods
Image Processing, Computer-Assisted methods
Pattern Recognition, Automated methods
Subjects
Details
- Language :
- English
- ISSN :
- 1553-7358
- Volume :
- 7
- Issue :
- 7
- Database :
- MEDLINE
- Journal :
- PLoS computational biology
- Publication Type :
- Academic Journal
- Accession number :
- 21814502
- Full Text :
- https://doi.org/10.1371/journal.pcbi.1002098