Back to Search Start Over

A New Algorithm for Discriminative Clustering and Its Maximum Entropy Extension

Authors :
Xiao-Bin Zhi
Jiu-Lun Fan
Source :
Intelligence Science and Big Data Engineering. Big Data and Machine Learning Techniques ISBN: 9783319238616, IScIDE (2)
Publication Year :
2015
Publisher :
Springer International Publishing, 2015.

Abstract

Discriminative clustering DC can effectively integrates subspace selection and clustering into a coherent framework. It performs in the iterative classical Linear Discriminant Analysis LDA dimensionality reduction and clustering processing. DC can effectively cluster the data with high dimension. However, it has complex form and high computational complexity. Recent work shows DC is equivalent to kernel k-means KM with a specific kernel matrix. This new insights provides a chance of simplifying the optimization problem in the original DC algorithm. Based on this equivalence relationship, Discriminative K-means DKM algorithm is proposed. When the number of data points denoted as n is small, DKM is feasible and efficient. However, the construction of kernel matrix needs to compute the inverse of a matrix in DKM, when n is large, which is time consuming. In this paper, we concentratei?źon the efficiency of DC. We present a new framework for DC, namely, Efficient DC EDC, which consists of DKM and the whitening transformation of the regularized total scatter matrix WRTS plus KM clustering WRTS+KM. When m dimensions is small and n far outweighs m, namely, ni?źi?źi?źm, EDC can carry out WRTS+KM on data, which is more efficient than DKM. When n is small and m far outweighs n, namely, mi?źi?źi?źn, EDC can carry out DKM on data, which is more efficient. We also extend EDC to soft case, and propose Efficient Discriminative Maximum Entropy Clustering EDMEC, which is an efficient version of maximum entropy based DC. Extensive experiments on a collection of benchmark data sets are presented to show the effectiveness of the proposed algorithms.

Details

ISBN :
978-3-319-23861-6
ISBNs :
9783319238616
Database :
OpenAIRE
Journal :
Intelligence Science and Big Data Engineering. Big Data and Machine Learning Techniques ISBN: 9783319238616, IScIDE (2)
Accession number :
edsair.doi...........795890bad9fc0e97506e17c6bbd2bc4e
Full Text :
https://doi.org/10.1007/978-3-319-23862-3_42