Back to Search Start Over

Single pass kernel k-means clustering method.

Authors :
SARMA, T
VISWANATH, P
REDDY, B
Source :
Sādhanā: Academy Proceedings in Engineering Sciences; Jun2013, Vol. 38 Issue 3, p407-419, 13p
Publication Year :
2013

Abstract

In unsupervised classification, kernel k-means clustering method has been shown to perform better than conventional k-means clustering method in identifying non-isotropic clusters in a data set. The space and time requirements of this method are O( n), where n is the data set size. Because of this quadratic time complexity, the kernel k-means method is not applicable to work with large data sets. The paper proposes a simple and faster version of the kernel k-means clustering method, called single pass kernel k -means clustering method. The proposed method works as follows. First, a random sample $\mathcal{S}$ is selected from the data set $\mathcal{D}$. A partition $\Pi_{\mathcal{S}}$ is obtained by applying the conventional kernel k-means method on the random sample $\mathcal{S}$. The novelty of the paper is, for each cluster in $\Pi_{\mathcal{S}}$, the exact cluster center in the input space is obtained using the gradient descent approach. Finally, each unsampled pattern is assigned to its closest exact cluster center to get a partition of the entire data set. The proposed method needs to scan the data set only once and it is much faster than the conventional kernel k-means method. The time complexity of this method is O( s + t + nk) where s is the size of the random sample $\mathcal{S}$, k is the number of clusters required, and t is the time taken by the gradient descent method (to find exact cluster centers). The space complexity of the method is O( s). The proposed method can be easily implemented and is suitable for large data sets, like those in data mining applications. Experimental results show that, with a small loss of quality, the proposed method can significantly reduce the time taken than the conventional kernel k-means clustering method. The proposed method is also compared with other recent similar methods. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
02562499
Volume :
38
Issue :
3
Database :
Complementary Index
Journal :
Sādhanā: Academy Proceedings in Engineering Sciences
Publication Type :
Academic Journal
Accession number :
89518020
Full Text :
https://doi.org/10.1007/s12046-013-0143-3