Back to Search Start Over

Stop using the elbow criterion for k-means and how to choose the number of clusters instead

Authors :
Schubert, Erich
Publication Year :
2022

Abstract

A major challenge when using k-means clustering often is how to choose the parameter k, the number of clusters. In this letter, we want to point out that it is very easy to draw poor conclusions from a common heuristic, the "elbow method". Better alternatives have been known in literature for a long time, and we want to draw attention to some of these easy to use options, that often perform better. This letter is a call to stop using the elbow method altogether, because it severely lacks theoretic support, and we want to encourage educators to discuss the problems of the method -- if introducing it in class at all -- and teach alternatives instead, while researchers and reviewers should reject conclusions drawn from the elbow method.

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2212.12189
Document Type :
Working Paper
Full Text :
https://doi.org/10.1145/3606274.3606278