Back to Search Start Over

Simple Random Access Compression.

Authors :
Fredriksson, Kimmo
Nikitin, Fedor
Source :
Fundamenta Informaticae. 2009, Vol. 92 Issue 1-2, p63-81. 19p. 2 Charts, 1 Graph.
Publication Year :
2009

Abstract

Given a sequence S of n symbols over some alphabet Σ of size σ, we develop new compression methods that are (i) very simple to implement; (ii) provide O(1) time random access to any symbol (or short substring) of the original sequence. Our simplest solution uses at most 2h+o(h) bits of space, where h = n(H_{0}(S)+1), and H_{0}(S) is the zeroth-order empirical entropy of S. We discuss a number of improvements and trade-offs over the basic method. For example, we can achieve n(H_{k}(S)+1)+o(n(H_{k}(S)+1)) bits of space, for k = o(log_{σ}(n)). Several applications are discussed, including text compression, (compressed) full-text indexing and string matching. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
01692968
Volume :
92
Issue :
1-2
Database :
Academic Search Index
Journal :
Fundamenta Informaticae
Publication Type :
Academic Journal
Accession number :
40212812
Full Text :
https://doi.org/10.3233/FI-2009-0066