1. MannDB - a microbial database of automated protein sequence analyses and evidence integration for protein characterization.
- Author
-
Zhou CL, Lam MW, Smith JR, Zemla AT, Dyer MD, Kuczmarski TA, Vitalis EA, and Slezak TR
- Subjects
- Algorithms, Amino Acid Sequence, Bacterial Proteins classification, Bacterial Proteins genetics, Binding Sites, Computer Graphics, Database Management Systems, Internet, Molecular Sequence Data, Protein Binding, Proteome chemistry, Proteome classification, Proteome genetics, Proteome metabolism, Software, Systems Integration, Bacterial Proteins chemistry, Bacterial Proteins metabolism, Databases, Protein, Information Storage and Retrieval methods, Sequence Alignment methods, Sequence Analysis, Protein methods, User-Computer Interface
- Abstract
Background: MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data., Description: MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-source tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins) are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO., Conclusion: MannDB comprises a large number of genomes and comprehensive protein sequence analyses representing organisms listed as high-priority agents on the websites of several governmental organizations concerned with bio-terrorism. MannDB provides the user with a BLAST interface for comparison of native and non-native sequences and a query tool for conveniently selecting proteins of interest. In addition, the user has access to a web-based browser that compiles comprehensive and extensive reports. Access to MannDB is freely available at http://manndb.llnl.gov/.
- Published
- 2006
- Full Text
- View/download PDF