Back to Search
Start Over
A Bayesian framework for combining heterogeneous data sources for gene function prediction (in Saccharomyces cerevisiae )
- Source :
- Proceedings of the National Academy of Sciences. 100:8348-8353
- Publication Year :
- 2003
- Publisher :
- Proceedings of the National Academy of Sciences, 2003.
-
Abstract
- Genomic sequencing is no longer a novelty, but gene function annotation remains a key challenge in modern biology. A variety of functional genomics experimental techniques are available, from classic methods such as affinity precipitation to advanced high-throughput techniques such as gene expression microarrays. In the future, more disparate methods will be developed, further increasing the need for integrated computational analysis of data generated by these studies. We address this problem with magic (Multisource Association of Genes by Integration of Clusters), a general framework that uses formal Bayesian reasoning to integrate heterogeneous types of high-throughput biological data (such as large-scale two-hybrid screens and multiple microarray analyses) for accurate gene function prediction. The system formally incorporates expert knowledge about relative accuracies of data sources to combine them within a normative framework. magic provides a belief level with its output that allows the user to vary the stringency of predictions. We applied magic to Saccharomyces cerevisiae genetic and physical interactions, microarray, and transcription factor binding sites data and assessed the biological relevance of gene groupings using Gene Ontology annotations produced by the Saccaromyces Genome Database. We found that by creating functional groupings based on heterogeneous data types, magic improved accuracy of the groupings compared with microarray analysis alone. We describe several of the biological gene groupings identified.
- Subjects :
- Genetics
Biological data
Binding Sites
Saccharomyces cerevisiae Proteins
Multidisciplinary
Microarray analysis techniques
Genes, Fungal
Magic (programming)
Bayes Theorem
Saccharomyces cerevisiae
Computational biology
Biological Sciences
Biology
Data type
Annotation
Bayes' theorem
ComputingMethodologies_PATTERNRECOGNITION
Genetic Techniques
Protein Interaction Mapping
Critical Assessment of Function Annotation
Functional genomics
Algorithms
Software
Oligonucleotide Array Sequence Analysis
Transcription Factors
Subjects
Details
- ISSN :
- 10916490 and 00278424
- Volume :
- 100
- Database :
- OpenAIRE
- Journal :
- Proceedings of the National Academy of Sciences
- Accession number :
- edsair.doi.dedup.....79415511881035386ccef662824d0927
- Full Text :
- https://doi.org/10.1073/pnas.0832373100