Back to Search Start Over

B-assembler: a circular bacterial genome assembler.

Authors :
Huang F
Xiao L
Gao M
Vallely EJ
Dybvig K
Atkinson TP
Waites KB
Chong Z
Source :
BMC genomics [BMC Genomics] 2022 May 11; Vol. 23 (Suppl 4), pp. 361. Date of Electronic Publication: 2022 May 11.
Publication Year :
2022

Abstract

Background: Accurate bacteria genome de novo assembly is fundamental to understand the evolution and pathogenesis of new bacteria species. The advent and popularity of Third-Generation Sequencing (TGS) enables assembly of bacteria genomes at an unprecedented speed. However, most current TGS assemblers were specifically designed for human or other species that do not have a circular genome. Besides, the repetitive DNA fragments in many bacterial genomes plus the high error rate of long sequencing data make it still very challenging to accurately assemble their genomes even with a relatively small genome size. Therefore, there is an urgent need for the development of an optimized method to address these issues.<br />Results: We developed B-assembler, which is capable of assembling bacterial genomes when there are only long reads or a combination of short and long reads. B-assembler takes advantage of the structural resolving power of long reads and the accuracy of short reads if applicable. It first selects and corrects the ultra-long reads to get an initial contig. Then, it collects the reads overlapping with the ends of the initial contig. This two-round assembling procedure along with optimized error correction enables a high-confidence and circularized genome assembly. Benchmarked on both synthetic and real sequencing data of several species of bacterium, the results show that both long-read-only and hybrid-read modes can accurately assemble circular bacterial genomes free of structural errors and have fewer small errors compared to other assemblers.<br />Conclusions: B-assembler provides a better solution to bacterial genome assembly, which will facilitate downstream bacterial genome analysis.<br /> (© 2022. The Author(s).)

Details

Language :
English
ISSN :
1471-2164
Volume :
23
Issue :
Suppl 4
Database :
MEDLINE
Journal :
BMC genomics
Publication Type :
Academic Journal
Accession number :
35546658
Full Text :
https://doi.org/10.1186/s12864-022-08577-7