Back to Search Start Over

Fixed Block Compression Boosting in FM-Indexes: Theory and Practice.

Authors :
Gog, Simon
Kärkkäinen, Juha
Kempa, Dominik
Petri, Matthias
Puglisi, Simon J.
Source :
Algorithmica. Apr2019, Vol. 81 Issue 4, p1370-1391. 22p.
Publication Year :
2019

Abstract

The FM index (Ferragina and Manzini in J ACM 52(4):552–581, 2005) is a widely-used compressed data structure that stores a string T in a compressed form and also supports fast pattern matching queries. In this paper, we describe new FM-index variants that combine nice theoretical properties, simple implementation and improved practical performance. Our main theoretical result is a new technique called fixed block compression boosting, which is a simpler and faster alternative to optimal compression boosting and implicit compression boosting used in previous FM-indexes. We also describe several new techniques for implementing fixed-block boosting efficiently, including a new, fast, and space-efficient implementation of wavelet trees. Our extensive experiments show the new indexes to be consistently fast and small relative to the state-of-the-art, and thus they make a good "off-the-shelf" choice for many applications. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01784617
Volume :
81
Issue :
4
Database :
Academic Search Index
Journal :
Algorithmica
Publication Type :
Academic Journal
Accession number :
135753099
Full Text :
https://doi.org/10.1007/s00453-018-0475-9