Back to Search Start Over

RNACache: A scalable approach to rapid transcriptomic read mapping using locality sensitive hashing.

Authors :
Cascitti, Julian
Niebler, Stefan
Müller, André
Schmidt, Bertil
Source :
Journal of Computational Science; Apr2022, Vol. 60, pN.PAG-N.PAG, 1p
Publication Year :
2022

Abstract

Mapping of reads to transcriptomes is a crucial initial step for bioinformatics RNA-seq pipelines. As alignment-based methods exhibit high computational complexities, lightweight alignment-free methods are becoming increasingly important. We present RNACache – a novel approach to the detection of local similarities between transcriptomes and RNA-seq reads based on context-aware locality sensitive hashing. It consists of a three-step processing pipeline consisting of subsampling of k -mers, match-based (online) filtering, and coverage-based filtering in order to identify truly expressed transcript isoforms. Our performance evaluation shows that RNACache produces transcriptomic mappings of high accuracy that include significantly fewer erroneous matches compared to the state-of-the-art lightweight mappers RapMap, Salmon, and Kallisto. Furthermore, it offers good scalability in terms of number of utilized CPU cores and has the best runtime performance at low memory consumption on modern multi-core workstations. This is an extended version of our previously published conference paper (Cascitti et al., 2021). RNACache is available at https://github.com/jcasc/rnacache. • An LSH-based approach can accurately map RNA-seq reads to transcriptomes. • Unexpressed Isoforms can be distinguished by target coverage. • LSH-based mapping is much faster than alignments and scalable. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
18777503
Volume :
60
Database :
Supplemental Index
Journal :
Journal of Computational Science
Publication Type :
Periodical
Accession number :
156452357
Full Text :
https://doi.org/10.1016/j.jocs.2022.101572