Back to Search Start Over

ToRQuEMaDA: tool for retrieving queried Eubacteria, metadata and dereplicating assemblies

Authors :
Marie Leleu
Denis Baurain
Frédéric Kerff
Mick Van Vlierberghe
Raphaël R. Léonard
Source :
PeerJ, Vol 9, p e11348 (2021), PeerJ
Publication Year :
2021
Publisher :
PeerJ Inc., 2021.

Abstract

TQMD is a tool which downloads, stores and produces lists of dereplicated prokaryotic genomes. It has been developed to counter the ever-growing number of prokaryotic genomes and their uneven taxonomic distribution. It is based on word-based alignment-free methods (k-mers), an iterative single-linkage approach and a divide-and-conquer strategy to remain both efficient and scalable. We studied the performance of TQMD by verifying the influence of its parameters and heuristics on the clustering outcome. We further compared TQMD to two other dereplication tools (dRep and Assembly-Dereplicator). Our results showed that TQMD is optimized to dereplicate at high taxonomic levels (phylum/class), whereas the other dereplication tools are optimized for lower taxonomic levels (species/strain), making TQMD complementary to the existing dereplicating tools. TQMD is available at <https://bitbucket.org/phylogeno/tqmd>.

Details

Language :
English
ISSN :
21678359
Volume :
9
Database :
OpenAIRE
Journal :
PeerJ
Accession number :
edsair.doi.dedup.....ef9bc1de2131e1626a30991bd942a9a4