Back to Search Start Over

On the hardness of finding optimal multiple preset dictionaries

Authors :
Mitzenmacher, Michael
Source :
IEEE Transactions on Information Theory. July, 2004, Vol. 50 Issue 7, p1536, 4 p.
Publication Year :
2004

Abstract

We show that the following simple compression problem is NP-hard: given a collection of documents, find the pair of Huffman dictionaries that minimizes the total compressed size of the collection, where the best dictionary from the pair is used to compress each document. We also show the NP-hardness of finding optimal multiple preset dictionaries for LZ'77-based compression schemes. Our reductions make use of the catalog segmentation problem, a natural partitioning problem. Our results justify heuristic attacks used in practice. Index Terms--Huffman coding, LZ'77, NP-completeness, preset dictionaries, two-stage compression.

Subjects

Subjects :
Information theory -- Research

Details

Language :
English
ISSN :
00189448
Volume :
50
Issue :
7
Database :
Gale General OneFile
Journal :
IEEE Transactions on Information Theory
Publication Type :
Academic Journal
Accession number :
edsgcl.118957190