Back to Search Start Over

Functional cohesion of gene sets determined by latent semantic indexing of PubMed abstracts.

Authors :
Lijing Xu
Nicholas Furlotte
Yunyue Lin
Kevin Heinrich
Michael W Berry
Ebenezer O George
Ramin Homayouni
Source :
PLoS ONE, Vol 6, Iss 4, p e18851 (2011)
Publication Year :
2011
Publisher :
Public Library of Science (PLoS), 2011.

Abstract

High-throughput genomic technologies enable researchers to identify genes that are co-regulated with respect to specific experimental conditions. Numerous statistical approaches have been developed to identify differentially expressed genes. Because each approach can produce distinct gene sets, it is difficult for biologists to determine which statistical approach yields biologically relevant gene sets and is appropriate for their study. To address this issue, we implemented Latent Semantic Indexing (LSI) to determine the functional coherence of gene sets. An LSI model was built using over 1 million Medline abstracts for over 20,000 mouse and human genes annotated in Entrez Gene. The gene-to-gene LSI-derived similarities were used to calculate a literature cohesion p-value (LPv) for a given gene set using a Fisher's exact test. We tested this method against genes in more than 6,000 functional pathways annotated in Gene Ontology (GO) and found that approximately 75% of gene sets in GO biological process category and 90% of the gene sets in GO molecular function and cellular component categories were functionally cohesive (LPv

Subjects

Subjects :
Medicine
Science

Details

Language :
English
ISSN :
19326203
Volume :
6
Issue :
4
Database :
Directory of Open Access Journals
Journal :
PLoS ONE
Publication Type :
Academic Journal
Accession number :
edsdoj.4776e0f82cda469f9745a98cdb54528a
Document Type :
article
Full Text :
https://doi.org/10.1371/journal.pone.0018851