Back to Search
Start Over
An Efficient Algorithm for Initializing Centroids in K-means Clustering
- Source :
- Journal of Kufa for Mathematics and Computer, Vol 3, Iss 2 (2016)
- Publication Year :
- 2016
- Publisher :
- Faculty of Computer Science and Mathematics, University of Kufa, 2016.
-
Abstract
- Clustering represents one of the most popular knowledge extraction algorithms in data mining techniques. Hierarchical and partitioning approaches are widely used in this field. Each has its own advantages, drawbacks and goals. K-means represents the most popular partitioning clusteringtechnique, however it suffers from two major drawbacks; time complexity and its sensitivity to the initial centroid values. The work in this paper presents an approach for estimating the starting initial centroids throughout three process including density based, normalization and smoothing ideas. The proposed algorithm has a strong mathematical foundation. The proposed approach was tested using a free standard data (20000 records). The results showed that the approach has better complexity and ensures the clustering convergence.
- Subjects :
- Data Mining
Clustering
K-means
Centroids
Complexity
Mathematics
QA1-939
Subjects
Details
- Language :
- English
- ISSN :
- 20761171 and 25180010
- Volume :
- 3
- Issue :
- 2
- Database :
- Directory of Open Access Journals
- Journal :
- Journal of Kufa for Mathematics and Computer
- Publication Type :
- Academic Journal
- Accession number :
- edsdoj.fa1a8f31a5d438ea18f2c15ea9f1949
- Document Type :
- article
- Full Text :
- https://doi.org/10.31642/JoKMC/2018/030203