Back to Search Start Over

Index selection for databases: a hardness study and a principled heuristic solution

Authors :
Chaudhuri, Surajit
Datar, Mayur
Narasayya, Vivek
Source :
IEEE Transactions on Knowledge and Data Engineering. Nov, 2004, Vol. 16 Issue 11, p1313, 11 p.
Publication Year :
2004

Abstract

We study the index selection problem: Given a workload consisting of SQL statements on a database, and a user-specified storage constraint, recommend a set of indexes that have the maximum benefit for the given workload. We present a formal statement for this problem and show that it is computationally 'hard' to solve or even approximate it. We develop a new algorithm for the problem which is based on treating the problem as a knapsack problem. The novelty of our approach lies in an LP (linear programming) based method that assigns benefits to individual indexes. For a slightly modified algorithm, that does more work, we prove that we can give instance specific guarantees about the quality of our solution. We conduct an extensive experimental evaluation of this new heuristic and compare it with previous solutions. Our results demonstrate that our solution is more scalable while achieving comparable quality. Index Terms--Index selection, approximation, hardness result, NP-hardness, knapsack, linear programming, scalability.

Details

Language :
English
ISSN :
10414347
Volume :
16
Issue :
11
Database :
Gale General OneFile
Journal :
IEEE Transactions on Knowledge and Data Engineering
Publication Type :
Academic Journal
Accession number :
edsgcl.124261523