Back to Search
Start Over
Predicting Enzyme Class From Protein Structure Without Alignments
- Source :
- Journal of Molecular Biology. 345:187-199
- Publication Year :
- 2005
- Publisher :
- Elsevier BV, 2005.
-
Abstract
- Methods for predicting protein function from structure are becoming more important as the rate at which structures are solved increases more rapidly than experimental knowledge. As a result, protein structures now frequently lack functional annotations. The majority of methods for predicting protein function are reliant upon identifying a similar protein and transferring its annotations to the query protein. This method fails when a similar protein cannot be identified, or when any similar proteins identified also lack reliable annotations. Here, we describe a method that can assign function from structure without the use of algorithms reliant upon alignments. Using simple attributes that can be calculated from any crystal structure, such as secondary structure content, amino acid propensities, surface properties and ligands, we describe each enzyme in a non-redundant set. The set is split according to Enzyme Classification (EC) number. We combine the predictions of one-class versus one-class support vector machine models to make overall assignments of EC number to an accuracy of 35% with the top-ranked prediction, rising to 60% accuracy with the top two ranks. In doing so we demonstrate the utility of simple structural attributes in protein function prediction and shed light on the link between structure and function. We apply our methods to predict the function of every currently unclassified protein in the Protein Data Bank.
- Subjects :
- Protein structure database
Protein Conformation
Sequence alignment
computer.file_format
Computational biology
Protein structure prediction
Biology
Bioinformatics
Protein Data Bank
Enzymes
Structural genomics
Protein structure
Structural Biology
Protein function prediction
Amino Acids
Databases, Protein
Sequence Alignment
Molecular Biology
computer
Protein secondary structure
Algorithms
Subjects
Details
- ISSN :
- 00222836
- Volume :
- 345
- Database :
- OpenAIRE
- Journal :
- Journal of Molecular Biology
- Accession number :
- edsair.doi.dedup.....29865c19bcea1d99ca698ab13e394c3c
- Full Text :
- https://doi.org/10.1016/j.jmb.2004.10.024