Back to Search
Start Over
CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences
- Source :
- BMC Genomics, BMC Genomics, Vol 13, Iss 1, p 715 (2012)
- Publisher :
- Springer Nature
-
Abstract
- Background The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need. Results We have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR) are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities. Conclusions CPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible from http://www.herbalgenomics.org/cpgavas.
- Subjects :
- Web server
lcsh:QH426-470
Inverted repeat
lcsh:Biotechnology
Annotation
Statistics as Topic
Computational biology
Biology
computer.software_genre
Genome
Species Specificity
lcsh:TP248.13-248.65
Databases, Genetic
Genetics
Chloroplast genome
Genome, Chloroplast
Whole genome sequencing
Internet
CPGAVAS
food and beverages
Molecular Sequence Annotation
lcsh:Genetics
GenBank
DNA microarray
Databases, Nucleic Acid
computer
Software
Biotechnology
Subjects
Details
- Language :
- English
- ISSN :
- 14712164
- Volume :
- 13
- Issue :
- 1
- Database :
- OpenAIRE
- Journal :
- BMC Genomics
- Accession number :
- edsair.doi.dedup.....de89987b421287a36e92f1c862b0eff4
- Full Text :
- https://doi.org/10.1186/1471-2164-13-715