Back to Search Start Over

Using a literature-based NMF model for discovering gene functional relationships.

Authors :
Tjioe, Elina
Berry, Michael
Homayouni, Ramin
Heinrich, Kevin
Source :
BMC Bioinformatics; 2008 Supplement 7, Vol. 9, Special section p1-3, 3p, 2 Color Photographs
Publication Year :
2008

Abstract

Background The rapid growth of the biomedical literature and genomic information present a major challenge for determining the functional relationships among genes. Several bioinformatics tools have been developed to extract and identify gene relationships from various biological databases. In this study, we develop a Web-based bioinformatics tool called Feature Annotation Using Nonnegative matrix factorization (FAUN) to facilitate both the discovery and classification of functional relationships among genes. The algorithms of nonnegative matrix factorization (NMF) described in [1] are used. Both the computational complexity and parameterization of NMF for processing gene sets are discussed. FAUN is first tested on a small manually constructed 50-gene (50TG) collection that we, as well as others, have previously used [2]. The screenshots of FAUN feature classification and gene-to-gene correlation for the 50TG collection are shown in Figures 1 and 2. We then apply FAUN to analyze several microarray-derived gene sets obtained from studies of the developing cerebellum in normal and mutant mice. FAUN provides utilities for collaborative knowledge discovery and identification of new gene relationships from text streams and repositories (e.g. MEDLINE). It is particularly useful for the validation and analysis of gene associations suggested by microarray experimentation. FAUN tool is publicly available at https://shad.eecs.utk.edu/faun. Discussion For a preliminary assessment of FAUN feature classification, each gene in the 50TG collection was classified based on its most dominant annotated feature or based on some feature weight threshold. The FAUN classification using the strongest feature (per gene) yielded 90% accuracy. A FAUN-based analysis of a new cerebellum gene set has revealed new knowledge -- the gene set contains a large component of transcription factors. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
14712105
Volume :
9
Database :
Complementary Index
Journal :
BMC Bioinformatics
Publication Type :
Academic Journal
Accession number :
35552669
Full Text :
https://doi.org/10.1186/1471-2105-9-S7-P1