Back to Search Start Over

A new approach to generate diversified clusters for small data sets.

Authors :
Peng, Chun-Cheng
Tsai, Cheng-Jung
Chang, Ting-Yi
Yeh, Jen-Yuan
Hua, Po-Wei
Source :
Applied Soft Computing; Oct2020, Vol. 95, pN.PAG-N.PAG, 1p
Publication Year :
2020

Abstract

Clustering is a common data mining technique whose main principle states that the samples within a cluster are similar to one another and dissimilar to those in other clusters. This means that samples in the same cluster possess high homogeneity, while different clusters possess high heterogeneity. However, a user may require a result of diversified clustering. Compared to traditional clustering methods, the aim of diversified clustering is to make samples of the same cluster possess high heterogeneity, and different clusters possess high homogeneity. Diversified clustering can be practically applied to aspects of our daily lives such as normal class grouping, student grouping in learning, cluster sampling, balanced diets and assignment of jobs. Nevertheless, our survey of related papers in the research field of data mining found that there has been no proposed research for diversified clustering. In this paper, we formal define the problem of diversified clustering and propose a new method to solve this problem. Experimental results showed that our method can generate good diversified clustering. However, our method is currently only appropriate for small data sets since the execution time of our method increases quickly as the number of diversified clusters increases. We also hope this paper will garner interest in more research on effective methods to generate diversified clusters for use in data mining. • This paper addresses a new problem "diversified clustering". • This paper proposes a novel approach to generate diversified clusters. • Experimental results showed that Ripple can generate better diversified clusters. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15684946
Volume :
95
Database :
Supplemental Index
Journal :
Applied Soft Computing
Publication Type :
Academic Journal
Accession number :
146147716
Full Text :
https://doi.org/10.1016/j.asoc.2020.106564