Back to Search
Start Over
Comprehensive survey on hierarchical clustering algorithms and the recent developments.
- Source :
- Artificial Intelligence Review; Aug2023, Vol. 56 Issue 8, p8219-8264, 46p
- Publication Year :
- 2023
-
Abstract
- Data clustering is a commonly used data processing technique in many fields, which divides objects into different clusters in terms of some similarity measure between data points. Comparing to partitioning clustering methods which give a flat partition of the data, hierarchical clustering methods can give multiple consistent partitions of the data at different levels for the same data without rerunning clustering, it can be used to better analyze the complex structure of the data. There are usually two kinds of hierarchical clustering methods: divisive and agglomerative. For the divisive clustering, the key issue is how to select a cluster for the next splitting procedure according to dissimilarity and how to divide the selected cluster. For agglomerative hierarchical clustering, the key issue is the similarity measure that is used to select the two most similar clusters for the next merge. Although both types of the methods produce the dendrogram of the data as output, the clustering results may be very different depending on the dissimilarity or similarity measure used in the clustering, and different types of methods should be selected according to different types of the data and different application scenarios. So, we have reviewed various hierarchical clustering methods comprehensively, especially the most recently developed methods, in this work. The similarity measure plays a crucial role during hierarchical clustering process, we have reviewed different types of the similarity measure along with the hierarchical clustering. More specifically, different types of hierarchical clustering methods are comprehensively reviewed from six aspects, and their advantages and drawbacks are analyzed. The application of some methods in real life is also discussed. Furthermore, we have also included some recent works in combining deep learning techniques and hierarchical clustering, which is worth serious attention and may improve the hierarchical clustering significantly in the future. [ABSTRACT FROM AUTHOR]
Details
- Language :
- English
- ISSN :
- 02692821
- Volume :
- 56
- Issue :
- 8
- Database :
- Complementary Index
- Journal :
- Artificial Intelligence Review
- Publication Type :
- Academic Journal
- Accession number :
- 164580073
- Full Text :
- https://doi.org/10.1007/s10462-022-10366-3