Back to Search Start Over

Exploiting Homogeneity of Density in Incremental Hierarchical Clustering

Authors :
Dwi H. Widiyantoro
Source :
ITB Journal of Engineering Science, Vol 38, Iss 2, Pp 79-98 (2006)
Publication Year :
2006
Publisher :
ITB Journal Publisher, 2006.

Abstract

Hierarchical clustering is an important tool in many applications. As it involves a large data set that proliferates over time, reclustering the data set periodically is not an efficient process. Therefore, the ability to incorporate a new data set incrementally into an existing hierarchy becomes increasingly demanding. This article describes Homogen, a system that employs a new algorithm for generating a hierarchy of concepts and clusters incrementally from a stream of observations. The system aims to construct a hierarchy that satisfies the homogeneity and the monotonicity properties. Working in a bottom-up fashion, a new observation is placed in the hierarchy and a sequence of hierarchy restructuring processes is performed only in regions that have been affected by the presence of the new observation. Additionally, it combines multiple restructuring techniques that address different restructuring objectives to get a synergistic effect. The system has been tested on a variety of domains including structured and unstructured data sets. The experimental results reveal that the system is able to construct a concept hierarchy that is consistent regardless of the input data order and whose quality is comparable to the quality of those produced by non incremental clustering algorithms.

Details

Language :
English
ISSN :
19783051
Volume :
38
Issue :
2
Database :
Directory of Open Access Journals
Journal :
ITB Journal of Engineering Science
Publication Type :
Academic Journal
Accession number :
edsdoj.198fd37f6f2c48fe89fe78051f5eaabe
Document Type :
article