Back to Search Start Over

The Implementation of Minimum Forest Graph for Centroid Updating Process on K-Means Algorithm

Authors :
Wahyu Catur Wibowo
Achmad Maududie
Source :
INFORMAL: Informatics Journal. 3:67
Publication Year :
2019
Publisher :
UPT Penerbitan Universitas Jember, 2019.

Abstract

K-Means is a well known algorithms of clusteing. It generates some groups based on degree of similarity. Simplicity of implementation, ease of interpretation, adaptability to sparse data, linear complexity, speed of convergence, and versatile in almost every aspect are noble characteristics of this algorithm. However, this algorithm is very sensitive on defining initial centroids process. Giving a bad initial centroid always produces a bad quality output. Due to this weakness, it is recommended to make some runs with different initial centroids and select the initial centroid that produces cluster with minimum error. However, this procedure is hard to achieve a satisfying result. This paper introduces a new approach to minimize the initial centroid problem of K-Means algorithm. This approach focus on centroid updating stage in K-Means algorithm by applying minimum forest graph to produce better new centroids. Based on gain information and Dunn index values, this approach provided a better result than Forgy method when this approach tested on both well distributed and noisy dataset. Moreover, from the experiments with two dimentional data, the proposed approach produced consisten members of each cluster in every run, where it could not be found in Forgy method.

Details

ISSN :
2503250X
Volume :
3
Database :
OpenAIRE
Journal :
INFORMAL: Informatics Journal
Accession number :
edsair.doi...........673d36a3853087e1d645cc019c34ce91
Full Text :
https://doi.org/10.19184/isj.v3i3.10239