Back to Search Start Over

An Improved K-Means Algorithm Based on Contour Similarity.

Authors :
Zhao, Jing
Bao, Yanke
Li, Dongsheng
Guan, Xinguo
Source :
Mathematics (2227-7390). Jul2024, Vol. 12 Issue 14, p2211. 16p.
Publication Year :
2024

Abstract

The traditional k-means algorithm is widely used in large-scale data clustering because of its easy implementation and efficient process, but it also suffers from the disadvantages of local optimality and poor robustness. In this study, a Csk-means algorithm based on contour similarity is proposed to overcome the drawbacks of the traditional k-means algorithm. For the traditional k-means algorithm, which results in local optimality due to the influence of outliers or noisy data and random selection of the initial clustering centers, the Csk-means algorithm overcomes both drawbacks by combining data lattice transformation and dissimilar interpolation. In particular, the Csk-means algorithm employs Fisher optimal partitioning of the similarity vectors between samples for the process of determining the number of clusters. To improve the robustness of the k-means algorithm to the shape of the clusters, the Csk-means algorithm utilizes contour similarity to compute the similarity between samples during the clustering process. Experimental results show that the Csk-means algorithm provides better clustering results than the traditional k-means algorithm and other comparative algorithms. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
22277390
Volume :
12
Issue :
14
Database :
Academic Search Index
Journal :
Mathematics (2227-7390)
Publication Type :
Academic Journal
Accession number :
178699847
Full Text :
https://doi.org/10.3390/math12142211