Back to Search Start Over

De novo meta-assembly of ultra-deep sequencing data

Authors :
Timothy J. Close
Stefano Lonardi
Hamid Mirebrahim
Source :
Bioinformatics, Bioinformatics (Oxford, England), vol 31, iss 12
Publication Year :
2015
Publisher :
Oxford University Press, 2015.

Abstract

We introduce a new divide and conquer approach to deal with the problem of de novo genome assembly in the presence of ultra-deep sequencing data (i.e. coverage of 1000x or higher). Our proposed meta-assembler Slicembler partitions the input data into optimal-sized ‘slices’ and uses a standard assembly tool (e.g. Velvet, SPAdes, IDBA_UD and Ray) to assemble each slice individually. Slicembler uses majority voting among the individual assemblies to identify long contigs that can be merged to the consensus assembly. To improve its efficiency, Slicembler uses a generalized suffix tree to identify these frequent contigs (or fraction thereof). Extensive experimental results on real ultra-deep sequencing data (8000x coverage) and simulated data show that Slicembler significantly improves the quality of the assembly compared with the performance of the base assembler. In fact, most of the times, Slicembler generates error-free assemblies. We also show that Slicembler is much more resistant against high sequencing error rate than the base assembler. Availability and implementation: Slicembler can be accessed at http://slicembler.cs.ucr.edu/. Contact: hamid.mirebrahim@email.ucr.edu

Details

Language :
English
ISSN :
13674811 and 13674803
Volume :
31
Issue :
12
Database :
OpenAIRE
Journal :
Bioinformatics
Accession number :
edsair.doi.dedup.....320fb465254a7d8221c7b61998c9184b