Back to Search
Start Over
Knowledge-based voting algorithm for automated protein functional annotation†
- Source :
- Proteins: Structure, Function, and Bioinformatics. 61:907-917
- Publication Year :
- 2005
- Publisher :
- Wiley, 2005.
-
Abstract
- Automated annotation of high-throughput genome sequences is one of the earliest steps toward a comprehensive understanding of the dynamic behavior of living organisms. However, the step is often error-prone because of its underlying algorithms, which rely mainly on a simple similarity analysis, and lack of guidance from biological rules. We present herein a knowledge-based protein annotation algorithm. Our objectives are to reduce errors and to improve annotation confidences. This algorithm consists of two major components: a knowledge system, called “RuleMiner,” and a voting procedure. The knowledge system, which includes biological rules and functional profiles for each function, provides a platform for seamless integration of multiple sequence analysis tools and guidance for function annotation. The voting procedure, which relies on the knowledge system, is designed to make (possibly) unbiased judgments in functional assignments among complicated, sometimes conflicting, information. We have applied this algorithm to 10 prokaryotic bacterial genomes and observed a significant improvement in annotation confidences. We also discuss the current limitations of the algorithm and the potential for future improvement. Proteins 2005. © 2005 Wiley-Liss, Inc.
- Subjects :
- Computer science
media_common.quotation_subject
Machine learning
computer.software_genre
Biochemistry
Automation
Annotation
Protein Annotation
Structural Biology
Voting
Escherichia coli
Protein function prediction
Amino Acid Sequence
Critical Assessment of Function Annotation
Function (engineering)
Molecular Biology
media_common
business.industry
Escherichia coli Proteins
Proteins
Functional annotation
Voting algorithm
Data mining
Artificial intelligence
business
computer
Algorithms
Genome, Bacterial
Subjects
Details
- ISSN :
- 08873585
- Volume :
- 61
- Database :
- OpenAIRE
- Journal :
- Proteins: Structure, Function, and Bioinformatics
- Accession number :
- edsair.doi.dedup.....ff4884446991f29496f7b065a814705e
- Full Text :
- https://doi.org/10.1002/prot.20652