Back to Search Start Over

StrainSeeker: fast identification of bacterial strains from raw sequencing reads using user-provided guide trees.

Authors :
Roosaare M
Vaher M
Kaplinski L
Möls M
Andreson R
Lepamets M
Kõressaar T
Naaber P
Kõljalg S
Remm M
Source :
PeerJ [PeerJ] 2017 May 18; Vol. 5, pp. e3353. Date of Electronic Publication: 2017 May 18 (Print Publication: 2017).
Publication Year :
2017

Abstract

Background: Fast, accurate and high-throughput identification of bacterial isolates is in great demand. The present work was conducted to investigate the possibility of identifying isolates from unassembled next-generation sequencing reads using custom-made guide trees.<br />Results: A tool named StrainSeeker was developed that constructs a list of specific k -mers for each node of any given Newick-format tree and enables the identification of bacterial isolates in 1-2 min. It uses a novel algorithm, which analyses the observed and expected fractions of node-specific k -mers to test the presence of each node in the sample. This allows StrainSeeker to determine where the isolate branches off the guide tree and assign it to a clade whereas other tools assign each read to a reference genome. Using a dataset of 100 Escherichia coli isolates, we demonstrate that StrainSeeker can predict the clades of E. coli with 92% accuracy and correct tree branch assignment with 98% accuracy. Twenty-five thousand Illumina HiSeq reads are sufficient for identification of the strain.<br />Conclusion: StrainSeeker is a software program that identifies bacterial isolates by assigning them to nodes or leaves of a custom-made guide tree. StrainSeeker's web interface and pre-computed guide trees are available at http://bioinfo.ut.ee/strainseeker. Source code is stored at GitHub: https://github.com/bioinfo-ut/StrainSeeker.<br />Competing Interests: Paul Naaber is an employee of Synlab Eesti, Tallinn, Estonia.

Details

Language :
English
ISSN :
2167-8359
Volume :
5
Database :
MEDLINE
Journal :
PeerJ
Publication Type :
Academic Journal
Accession number :
28533988
Full Text :
https://doi.org/10.7717/peerj.3353