Back to Search
Start Over
ModEnzA: Accurate Identification of Metabolic Enzymes Using Function Specific Profile HMMs with Optimised Discrimination Threshold and Modified Emission Probabilities.
- Source :
- Advances in Bioinformatics; 2011, p1-12, 12p
- Publication Year :
- 2011
-
Abstract
- Various enzyme identification protocols involving homology transfer by sequence-sequence or profile-sequence comparisons have been devised which utilise Swiss-Prot sequences associated with EC numbers as the training set. A profile HMM constructed for a particular EC number might select sequences which perform a different enzymatic function due to the presence of certain fold-specific residues which are conserved in enzymes sharing a common fold. We describe a protocol, ModEnzA (HMM-ModE Enzyme Annotation), which generates profile HMMs highly specific at a functional level as defined by the EC numbers by incorporating information from negative training sequences. We enrich the training dataset by mining sequences from the NCBI Non-Redundant database for increased sensitivity. We compare our method with other enzyme identification methods, both for assigning EC numbers to a genome as well as identifying protein sequences associated with an enzymatic activity. We report a sensitivity of 88% and specificity of 95% in identifying EC numbers and annotating enzymatic sequences fromthe E. coli genome which is higher than any othermethod.With the next-generation sequencingmethods producing a huge amount of sequence data, the development and use of fully automated yet accurate protocols such as ModEnzA is warranted for rapid annotation of newly sequenced genomes and metagenomic sequences. [ABSTRACT FROM AUTHOR]
- Subjects :
- METABOLISM
ENZYMES
GENOMES
PROTEIN folding
PROTEIN conformation
ESCHERICHIA coli
Subjects
Details
- Language :
- English
- ISSN :
- 16878027
- Database :
- Complementary Index
- Journal :
- Advances in Bioinformatics
- Publication Type :
- Academic Journal
- Accession number :
- 71491739
- Full Text :
- https://doi.org/10.1155/2011/743782