Back to Search
Start Over
Fixed Block Compression Boosting in FM-Indexes: Theory and Practice.
- Source :
-
Algorithmica . Apr2019, Vol. 81 Issue 4, p1370-1391. 22p. - Publication Year :
- 2019
-
Abstract
- The FM index (Ferragina and Manzini in J ACM 52(4):552–581, 2005) is a widely-used compressed data structure that stores a string T in a compressed form and also supports fast pattern matching queries. In this paper, we describe new FM-index variants that combine nice theoretical properties, simple implementation and improved practical performance. Our main theoretical result is a new technique called fixed block compression boosting, which is a simpler and faster alternative to optimal compression boosting and implicit compression boosting used in previous FM-indexes. We also describe several new techniques for implementing fixed-block boosting efficiently, including a new, fast, and space-efficient implementation of wavelet trees. Our extensive experiments show the new indexes to be consistently fast and small relative to the state-of-the-art, and thus they make a good "off-the-shelf" choice for many applications. [ABSTRACT FROM AUTHOR]
- Subjects :
- *PATTERN matching
*THEORY-practice relationship
*DATA structures
Subjects
Details
- Language :
- English
- ISSN :
- 01784617
- Volume :
- 81
- Issue :
- 4
- Database :
- Academic Search Index
- Journal :
- Algorithmica
- Publication Type :
- Academic Journal
- Accession number :
- 135753099
- Full Text :
- https://doi.org/10.1007/s00453-018-0475-9