Back to Search
Start Over
ToRQuEMaDA: tool for retrieving queried Eubacteria, metadata and dereplicating assemblies
- Source :
- PeerJ, Vol 9, p e11348 (2021), PeerJ
- Publication Year :
- 2021
- Publisher :
- PeerJ Inc., 2021.
-
Abstract
- TQMD is a tool which downloads, stores and produces lists of dereplicated prokaryotic genomes. It has been developed to counter the ever-growing number of prokaryotic genomes and their uneven taxonomic distribution. It is based on word-based alignment-free methods (k-mers), an iterative single-linkage approach and a divide-and-conquer strategy to remain both efficient and scalable. We studied the performance of TQMD by verifying the influence of its parameters and heuristics on the clustering outcome. We further compared TQMD to two other dereplication tools (dRep and Assembly-Dereplicator). Our results showed that TQMD is optimized to dereplicate at high taxonomic levels (phylum/class), whereas the other dereplication tools are optimized for lower taxonomic levels (species/strain), making TQMD complementary to the existing dereplicating tools. TQMD is available at <https://bitbucket.org/phylogeno/tqmd>.
- Subjects :
- Alignment-free methods
Information retrieval
Singularity
Bioinformatics
Phylum
Computer science
Strain (biology)
Phylogenomics
Genomics
NCBI RefSeq
Dereplication
Microbiology
Metadata
Genome selection
Scalability
Medicine
Metagenomics
Taxonomic rank
Prokaryotes
Heuristics
Cluster analysis
Word (computer architecture)
Taxonomy
Genome quality
Subjects
Details
- Language :
- English
- ISSN :
- 21678359
- Volume :
- 9
- Database :
- OpenAIRE
- Journal :
- PeerJ
- Accession number :
- edsair.doi.dedup.....ef9bc1de2131e1626a30991bd942a9a4