Back to Search
Start Over
A new approach to generate diversified clusters for small data sets.
- Source :
- Applied Soft Computing; Oct2020, Vol. 95, pN.PAG-N.PAG, 1p
- Publication Year :
- 2020
-
Abstract
- Clustering is a common data mining technique whose main principle states that the samples within a cluster are similar to one another and dissimilar to those in other clusters. This means that samples in the same cluster possess high homogeneity, while different clusters possess high heterogeneity. However, a user may require a result of diversified clustering. Compared to traditional clustering methods, the aim of diversified clustering is to make samples of the same cluster possess high heterogeneity, and different clusters possess high homogeneity. Diversified clustering can be practically applied to aspects of our daily lives such as normal class grouping, student grouping in learning, cluster sampling, balanced diets and assignment of jobs. Nevertheless, our survey of related papers in the research field of data mining found that there has been no proposed research for diversified clustering. In this paper, we formal define the problem of diversified clustering and propose a new method to solve this problem. Experimental results showed that our method can generate good diversified clustering. However, our method is currently only appropriate for small data sets since the execution time of our method increases quickly as the number of diversified clusters increases. We also hope this paper will garner interest in more research on effective methods to generate diversified clusters for use in data mining. • This paper addresses a new problem "diversified clustering". • This paper proposes a novel approach to generate diversified clusters. • Experimental results showed that Ripple can generate better diversified clusters. [ABSTRACT FROM AUTHOR]
- Subjects :
- CLUSTER sampling
DATA mining
ABILITY grouping (Education)
Subjects
Details
- Language :
- English
- ISSN :
- 15684946
- Volume :
- 95
- Database :
- Supplemental Index
- Journal :
- Applied Soft Computing
- Publication Type :
- Academic Journal
- Accession number :
- 146147716
- Full Text :
- https://doi.org/10.1016/j.asoc.2020.106564