Back to Search Start Over

Roles for text mining in protein function prediction.

Authors :
Verspoor KM
Source :
Methods in molecular biology (Clifton, N.J.) [Methods Mol Biol] 2014; Vol. 1159, pp. 95-108.
Publication Year :
2014

Abstract

The Human Genome Project has provided science with a hugely valuable resource: the blueprints for life; the specification of all of the genes that make up a human. While the genes have all been identified and deciphered, it is proteins that are the workhorses of the human body: they are essential to virtually all cell functions and are the primary mechanism through which biological function is carried out. Hence in order to fully understand what happens at a molecular level in biological organisms, and eventually to enable development of treatments for diseases where some aspect of a biological system goes awry, we must understand the functions of proteins. However, experimental characterization of protein function cannot scale to the vast amount of DNA sequence data now available. Computational protein function prediction has therefore emerged as a problem at the forefront of modern biology (Radivojac et al., Nat Methods 10(13):221-227, 2013).Within the varied approaches to computational protein function prediction that have been explored, there are several that make use of biomedical literature mining. These methods take advantage of information in the published literature to associate specific proteins with specific protein functions. In this chapter, we introduce two main strategies for doing this: association of function terms, represented as Gene Ontology terms (Ashburner et al., Nat Genet 25(1):25-29, 2000), to proteins based on information in published articles, and a paradigm called LEAP-FS (Literature-Enhanced Automated Prediction of Functional Sites) in which literature mining is used to validate the predictions of an orthogonal computational protein function prediction method.

Details

Language :
English
ISSN :
1940-6029
Volume :
1159
Database :
MEDLINE
Journal :
Methods in molecular biology (Clifton, N.J.)
Publication Type :
Academic Journal
Accession number :
24788263
Full Text :
https://doi.org/10.1007/978-1-4939-0709-0_6