Back to Search Start Over

Missing gene identification using functional coherence scores.

Authors :
Chitale M
Khan IK
Kihara D
Source :
Scientific reports [Sci Rep] 2016 Aug 24; Vol. 6, pp. 31725. Date of Electronic Publication: 2016 Aug 24.
Publication Year :
2016

Abstract

Reconstructing metabolic and signaling pathways is an effective way of interpreting a genome sequence. A challenge in a pathway reconstruction is that often genes in a pathway cannot be easily found, reflecting current imperfect information of the target organism. In this work, we developed a new method for finding missing genes, which integrates multiple features, including gene expression, phylogenetic profile, and function association scores. Particularly, for considering function association between candidate genes and neighboring proteins to the target missing gene in the network, we used Co-occurrence Association Score (CAS) and PubMed Association Score (PAS), which are designed for capturing functional coherence of proteins. We showed that adding CAS and PAS substantially improve the accuracy of identifying missing genes in the yeast enzyme-enzyme network compared to the cases when only the conventional features, gene expression, phylogenetic profile, were used. Finally, it was also demonstrated that the accuracy improves by considering indirect neighbors to the target enzyme position in the network using a proper network-topology-based weighting scheme.

Details

Language :
English
ISSN :
2045-2322
Volume :
6
Database :
MEDLINE
Journal :
Scientific reports
Publication Type :
Academic Journal
Accession number :
27552989
Full Text :
https://doi.org/10.1038/srep31725