Back to Search
Start Over
A clustering ensemble: Two-level-refined co-association matrix with path-based transformation
- Source :
- Pattern Recognition. 48:2699-2709
- Publication Year :
- 2015
- Publisher :
- Elsevier BV, 2015.
-
Abstract
- The aim of clustering ensemble is to combine multiple base partitions into a robust, stable and accurate partition. One of the key problems of clustering ensemble is how to exploit the cluster structure information in each base partition. Evidence accumulation is an effective framework which can convert the base partitions into a co-association matrix. This matrix describes the frequency of a pair of points partitioned into the same cluster, but ignores some hidden information in the base partitions. In this paper, we reveal some of those information by refining the co-association matrix from data point and base cluster level. From the data point level, as pairs of points in the same base cluster may have varied similarities, their contributions to the co-association matrix can be different. From the cluster level, since the base clusters may have diversified qualities, the contribution of a base cluster as a whole can also be different from those of others. After being refined, the co-association matrix is transformed into a path-based similarity matrix so that more global information of the cluster structure is incorporated into the matrix. Finally, spectral clustering is applied to the matrix to generate the final clustering result. Experimental results on 8 synthetic and 8 real data sets demonstrate that the clustering ensemble based on the refined co-association matrix outperforms some state-of-the-art clustering ensemble schemes. HighlightsA two-level-refined co-association matrix for cluster ensemble is proposed.The refined co-association matrix is transformed by path-based measure.A theoretical background of the refinement is given.The proposed method outperforms some state-of-the-art ensemble methods.
- Subjects :
- Fuzzy clustering
business.industry
Correlation clustering
Single-linkage clustering
Pattern recognition
Spectral clustering
Biclustering
ComputingMethodologies_PATTERNRECOGNITION
Distance matrix
Artificial Intelligence
Signal Processing
Computer Vision and Pattern Recognition
Artificial intelligence
Cluster analysis
business
Software
k-medians clustering
Mathematics
Subjects
Details
- ISSN :
- 00313203
- Volume :
- 48
- Database :
- OpenAIRE
- Journal :
- Pattern Recognition
- Accession number :
- edsair.doi...........89dfd1600e0060b5497de2dfa1f12f8a