Back to Search Start Over

ModEnzA: Accurate Identification of Metabolic Enzymes Using Function Specific Profile HMMs with Optimised Discrimination Threshold and Modified Emission Probabilities.

Authors :
Desai, Dhwani K.
Nandi, Soumyadeep
Srivastava, Prashant K.
Lynn, Andrew M.
Source :
Advances in Bioinformatics; 2011, p1-12, 12p
Publication Year :
2011

Abstract

Various enzyme identification protocols involving homology transfer by sequence-sequence or profile-sequence comparisons have been devised which utilise Swiss-Prot sequences associated with EC numbers as the training set. A profile HMM constructed for a particular EC number might select sequences which perform a different enzymatic function due to the presence of certain fold-specific residues which are conserved in enzymes sharing a common fold. We describe a protocol, ModEnzA (HMM-ModE Enzyme Annotation), which generates profile HMMs highly specific at a functional level as defined by the EC numbers by incorporating information from negative training sequences. We enrich the training dataset by mining sequences from the NCBI Non-Redundant database for increased sensitivity. We compare our method with other enzyme identification methods, both for assigning EC numbers to a genome as well as identifying protein sequences associated with an enzymatic activity. We report a sensitivity of 88% and specificity of 95% in identifying EC numbers and annotating enzymatic sequences fromthe E. coli genome which is higher than any othermethod.With the next-generation sequencingmethods producing a huge amount of sequence data, the development and use of fully automated yet accurate protocols such as ModEnzA is warranted for rapid annotation of newly sequenced genomes and metagenomic sequences. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
16878027
Database :
Complementary Index
Journal :
Advances in Bioinformatics
Publication Type :
Academic Journal
Accession number :
71491739
Full Text :
https://doi.org/10.1155/2011/743782