Back to Search Start Over

A Framework for Parallelizing Hierarchical Clustering Methods

Authors :
Benjamin Moseley
Kefu Lu
Thomas Lavastida
Silvio Lattanzi
Source :
Machine Learning and Knowledge Discovery in Databases ISBN: 9783030461492, ECML/PKDD (1)
Publication Year :
2020
Publisher :
Springer International Publishing, 2020.

Abstract

Hierarchical clustering is a fundamental tool in data mining, machine learning and statistics. Popular hierarchical clustering algorithms include top-down divisive approaches such as bisecting k-means, k-median, and k-center and bottom-up agglomerative approaches such as single-linkage, average-linkage, and centroid-linkage. Unfortunately, only a few scalable hierarchical clustering algorithms are known, mostly based on the single-linkage algorithm. So, as datasets increase in size every day, there is a pressing need to scale other popular methods.

Details

ISBN :
978-3-030-46149-2
ISBNs :
9783030461492
Database :
OpenAIRE
Journal :
Machine Learning and Knowledge Discovery in Databases ISBN: 9783030461492, ECML/PKDD (1)
Accession number :
edsair.doi...........c11ef764bcfd489913a2a08141b2b45d