Back to Search
Start Over
Vgas: A Viral Genome Annotation System
Vgas: A Viral Genome Annotation System
- Source :
- Frontiers in Microbiology, Frontiers in Microbiology, Vol 10 (2019)
- Publication Year :
- 2019
- Publisher :
- Frontiers Media S.A., 2019.
-
Abstract
- The in-depth study of viral genomes is of great help in many aspects, especially in the treatment of human diseases caused by viral infections. With the rapid accumulation of viral sequencing data, improved, or alternative gene-finding systems have become necessary to process and mine these data. In this article, we present Vgas, a system combining an ab initio method and a similarity-based method to automatically find viral genes and perform gene function annotation. Vgas was compared with existing programs, such as Prodigal, GeneMarkS, and Glimmer. Through testing 5,705 virus genomes downloaded from RefSeq, Vgas demonstrated its superiority with the highest average precision and recall (both indexes were 1% higher or more than the other programs); particularly for small virus genomes (≤ 10 kb), it showed significantly improved performance (precision was 6% higher, and recall was 2% higher). Moreover, Vgas presents an annotation module to provide functional information for predicted genes based on BLASTp alignment. This characteristic may be specifically useful in some cases. When combining Vgas with GeneMarkS and Prodigal, better prediction results could be obtained than with each of the three individual programs, suggesting that collaborative prediction using several different software programs is an alternative for gene prediction. Vgas is freely available at http://cefg.uestc.cn/vgas/ or http://121.48.162.133/vgas/. We hope that Vgas could be an alternative virus gene finder to annotate new genomes or reannotate existing genome.
- Subjects :
- Microbiology (medical)
0303 health sciences
030306 microbiology
Computer science
Gene prediction
lcsh:QR1-502
Computational biology
Genome project
function annotation
joint application of multiple programs
Genome
Microbiology
lcsh:Microbiology
03 medical and health sciences
Annotation
Improved performance
Vgas
RefSeq
Methods
virus gene prediction
Precision and recall
Gene
novel genes
030304 developmental biology
Subjects
Details
- Language :
- English
- ISSN :
- 1664302X
- Volume :
- 10
- Database :
- OpenAIRE
- Journal :
- Frontiers in Microbiology
- Accession number :
- edsair.doi.dedup.....63c6941544bdeef19ac91de7cc632d77