Back to Search Start Over

An Efficient Algorithm for Initializing Centroids in K-means Clustering

Authors :
Dr. Ahmed Hussain Aliwy
Dr. Kadhim B. S. Aljanabi
Source :
Journal of Kufa for Mathematics and Computer, Vol 3, Iss 2 (2016)
Publication Year :
2016
Publisher :
Faculty of Computer Science and Mathematics, University of Kufa, 2016.

Abstract

Clustering represents one of the most popular knowledge extraction algorithms in data mining techniques. Hierarchical and partitioning approaches are widely used in this field. Each has its own advantages, drawbacks and goals. K-means represents the most popular partitioning clusteringtechnique, however it suffers from two major drawbacks; time complexity and its sensitivity to the initial centroid values. The work in this paper presents an approach for estimating the starting initial centroids throughout three process including density based, normalization and smoothing ideas. The proposed algorithm has a strong mathematical foundation. The proposed approach was tested using a free standard data (20000 records). The results showed that the approach has better complexity and ensures the clustering convergence.

Details

Language :
English
ISSN :
20761171 and 25180010
Volume :
3
Issue :
2
Database :
Directory of Open Access Journals
Journal :
Journal of Kufa for Mathematics and Computer
Publication Type :
Academic Journal
Accession number :
edsdoj.fa1a8f31a5d438ea18f2c15ea9f1949
Document Type :
article
Full Text :
https://doi.org/10.31642/JoKMC/2018/030203