Back to Search
Start Over
Minirmd: accurate and fast duplicate removal tool for short reads via multiple minimizers.
- Source :
-
Bioinformatics . Jun2021, Vol. 37 Issue 11, p1604-1606. 3p. - Publication Year :
- 2021
-
Abstract
- Summary Removing duplicate and near-duplicate reads, generated by high-throughput sequencing technologies, is able to reduce computational resources in downstream applications. Here we develop minirmd, a de novo tool to remove duplicate reads via multiple rounds of clustering using different length of minimizer. Experiments demonstrate that minirmd removes more near-duplicate reads than existing clustering approaches and is faster than existing multi-core tools. To the best of our knowledge, minirmd is the first tool to remove near-duplicates on reverse-complementary strand. Availability and implementation https://github.com/yuansliu/minirmd. Supplementary information Supplementary data are available at Bioinformatics online. [ABSTRACT FROM AUTHOR]
- Subjects :
- *NUCLEOTIDE sequencing
*READING
Subjects
Details
- Language :
- English
- ISSN :
- 13674803
- Volume :
- 37
- Issue :
- 11
- Database :
- Academic Search Index
- Journal :
- Bioinformatics
- Publication Type :
- Academic Journal
- Accession number :
- 151369030
- Full Text :
- https://doi.org/10.1093/bioinformatics/btaa915