Back to Search
Start Over
RASTtk: A modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes
- Source :
- Scientific Reports
- Publication Year :
- 2015
- Publisher :
- Springer Nature, 2015.
-
Abstract
- The RAST (Rapid Annotation using Subsystem Technology) annotation engine was built in 2008 to annotate bacterial and archaeal genomes. It works by offering a standard software pipeline for identifying genomic features (i.e., protein-encoding genes and RNA) and annotating their functions. Recently, in order to make RAST a more useful research tool and to keep pace with advancements in bioinformatics, it has become desirable to build a version of RAST that is both customizable and extensible. In this paper, we describe the RAST tool kit (RASTtk), a modular version of RAST that enables researchers to build custom annotation pipelines. RASTtk offers a choice of software for identifying and annotating genomic features as well as the ability to add custom features to an annotation job. RASTtk also accommodates the batch submission of genomes and the ability to customize annotation protocols for batch submissions. This is the first major software restructuring of RAST since its inception. United States National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Service [HHSN272201400027C]; United States Department of Energy, DOE Systems Biology Knowledgebase [DE-AC02-06CH11357]; United States National Science Foundation [CNS-1305112]; Experimental and Computational Determination of Microbial Genotypes and Phenotypes [MCB-1330800]; National Aeronautics and Space Administration through the NASA Astrobiology Institute [NNA13AA91A] We thank Emily Dietrich for her helpful comments. This work was supported by the United States National Institute of Allergy and Infectious Diseases, National Institutes of Health, Department of Health and Human Service [Contract No. HHSN272201400027C]; the United States Department of Energy [DE-AC02-06CH11357], as part of the DOE Systems Biology Knowledgebase; R.A.E. was supported by United States National Science Foundation Grants grants II-EN: Computational Enhancement of Analytical Metagenomics Systems CNS-1305112, and Experimental and Computational Determination of Microbial Genotypes and Phenotypes MCB-1330800; and G.J.O. was supported by the National Aeronautics and Space Administration through the NASA Astrobiology Institute under Cooperative Agreement No. NNA13AA91A issued through the Science Mission Directorate. United States Department of Energy: National Institute of Allergy and Infectious Diseases.
- Subjects :
- Computer science
archaea
system
resource
Extensibility
Genome
Article
World Wide Web
resistance
03 medical and health sciences
Annotation
Resource (project management)
Software
generation
Gene
database
030304 developmental biology
0303 health sciences
microbial genomes
Multidisciplinary
030306 microbiology
business.industry
RNA
Molecular Sequence Annotation
Genome project
Modular design
Pipeline (software)
Identification (information)
ComputingMethodologies_PATTERNRECOGNITION
identification
sequences
rna genes
business
Software engineering
Subjects
Details
- Language :
- English
- Database :
- OpenAIRE
- Journal :
- Scientific Reports
- Accession number :
- edsair.doi.dedup.....055ac3a1956e0c9de6230cd261e6c72f