Back to Search Start Over

Virome Assembly and Annotation: A Surprise in the Namib Desert

Authors :
Uljana Hesse
Peter van Heusden
Israel Olonade
Leonardo Joaquim van Zyl
Bronwyn M. Kirby
Marla Trindade
Source :
Frontiers in Microbiology
Publication Year :
2017
Publisher :
Frontiers Media S.A., 2017.

Abstract

Sequencing, assembly, and annotation of environmental virome samples is challenging. Methodological biases and differences in species abundance result in fragmentary read coverage; sequence reconstruction is further complicated by the mosaic nature of viral genomes. In this paper, we focus on biocomputational aspects of virome analysis, emphasizing latent pitfalls in sequence annotation. Using simulated viromes that mimic environmental data challenges we assessed the performance of five assemblers (CLC-Workbench, IDBA-UD, SPAdes, RayMeta, ABySS). Individual analyses of relevant scaffold length fractions revealed shortcomings of some programs in reconstruction of viral genomes with excessive read coverage (IDBA-UD, RayMeta), and in accurate assembly of scaffolds ≥50 kb (SPAdes, RayMeta, ABySS). The CLC-Workbench assembler performed best in terms of genome recovery (including highly covered genomes) and correct reconstruction of large scaffolds; and was used to assemble a virome from a copper rich site in the Namib Desert. We found that scaffold network analysis and cluster-specific read reassembly improved reconstruction of sequences with excessive read coverage, and that strict data filtering for non-viral sequences prior to downstream analyses was essential. In this study we describe novel viral genomes identified in the Namib Desert copper site virome. Taxonomic affiliations of diverse proteins in the dataset and phylogenetic analyses of circovirus-like proteins indicated links to the marine habitat. Considering additional evidence from this dataset we hypothesize that viruses may have been carried from the Atlantic Ocean into the Namib Desert by fog and wind, highlighting the impact of the extended environment on an investigated niche in metagenome studies.

Details

Language :
English
ISSN :
1664302X
Volume :
8
Database :
OpenAIRE
Journal :
Frontiers in Microbiology
Accession number :
edsair.doi.dedup.....c63b990411b4a30a475b46cf0748a64e
Full Text :
https://doi.org/10.3389/fmicb.2017.00013