Back to Search Start Over

Clustering-Based Histograms for Multi-dimensional Data.

Authors :
Tjoa, A. Min
Trujillo, Juan
Furfaro, Filippo
Mazzeo, Giuseppe M.
Sirangelo, Cristina
Source :
Data Warehousing & Knowledge Discovery; 2005, p478-487, 10p
Publication Year :
2005

Abstract

A new technique for constructing multi-dimensional histograms is proposed. This technique first invokes a density-based clustering algorithm to locate dense and sparse regions of the input data. Then the data distribution inside each of these regions is summarized by partitioning it into non-overlapping blocks laid onto a grid. The granularity of this grid is chosen depending on the underlying data distribution: the more homogeneous the data, the coarser the grid. Our approach is compared with state-of-the-art histograms on both synthetic and real-life data and is shown to be more effective. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISBNs :
9783540285588
Database :
Supplemental Index
Journal :
Data Warehousing & Knowledge Discovery
Publication Type :
Book
Accession number :
32890981
Full Text :
https://doi.org/10.1007/11546849_47