Aziz, Ramy K., Bartels, Daniela, Best, Aaron A., DeJongh, Matthew, Disz, Terrence, Edwards, Robert A., Formsma, Kevin, Gerdes, Svetlana, Glass, Elizabeth M., Kuba, Michael, Meyer, Folker, Olsen, Gary J., Olson, Robert, Osterman, Andrei L., Overbeek, Ross A., McNeil, Leslie K., Paarmann, Daniel, Paczian, Tobias, Parrello, Bruce, and Pusch, Gordon D.
Background: The number of prokaryotic genome sequences becoming available is growing steadily and is growing faster than our ability to accurately annotate them. Description: We describe a fully automated service for annotating bacterial and archaeal genomes. The service identifies protein-encoding, rRNA and tRNA genes, assigns functions to the genes, predicts which subsystems are represented in the genome, uses this information to reconstruct the metabolic network and makes the output easily downloadable for the user. In addition, the annotated genome can be browsed in an environment that supports comparative analysis with the annotated genomes maintained in the SEED environment. The service normally makes the annotated genome available within 12-24 hours of submission, but ultimately the quality of such a service will be judged in terms of accuracy, consistency, and completeness of the produced annotations. We summarize our attempts to address these issues and discuss plans for incrementally enhancing the service. Conclusion: By providing accurate, rapid annotation freely to the community we have created an important community resource. The service has now been utilized by over 120 external users annotating over 350 distinct genomes. [ABSTRACT FROM AUTHOR]