Back to Search Start Over

Minirmd: accurate and fast duplicate removal tool for short reads via multiple minimizers.

Authors :
Liu, Yuansheng
Zhang, Xiaocai
Zou, Quan
Zeng, Xiangxiang
Source :
Bioinformatics. Jun2021, Vol. 37 Issue 11, p1604-1606. 3p.
Publication Year :
2021

Abstract

Summary Removing duplicate and near-duplicate reads, generated by high-throughput sequencing technologies, is able to reduce computational resources in downstream applications. Here we develop minirmd, a de novo tool to remove duplicate reads via multiple rounds of clustering using different length of minimizer. Experiments demonstrate that minirmd removes more near-duplicate reads than existing clustering approaches and is faster than existing multi-core tools. To the best of our knowledge, minirmd is the first tool to remove near-duplicates on reverse-complementary strand. Availability and implementation https://github.com/yuansliu/minirmd. Supplementary information Supplementary data are available at Bioinformatics online. [ABSTRACT FROM AUTHOR]

Subjects

Subjects :
*NUCLEOTIDE sequencing
*READING

Details

Language :
English
ISSN :
13674803
Volume :
37
Issue :
11
Database :
Academic Search Index
Journal :
Bioinformatics
Publication Type :
Academic Journal
Accession number :
151369030
Full Text :
https://doi.org/10.1093/bioinformatics/btaa915