Back to Search Start Over

Synthetic minority over-sampling technique based on fuzzy c-means clustering for imbalanced data

Authors :
Sungshin Kim
Min-Seok Kim
Jung Seunghyan
Hansoo Lee
Source :
2017 International Conference on Fuzzy Theory and Its Applications (iFUZZY).
Publication Year :
2017
Publisher :
IEEE, 2017.

Abstract

Some solutions to solve the class imbalance problem that is one of the representative difficulties in machine learning have been proposed. Among them, SMOTE algorithm is proposed recently to reduce the influence of the problem and shows its remarkable performance to solve real world problems. This paper proposes a novel kind of the SMOTE algorithm by combining the previous SMOTE algorithm and fuzzy logic to deal with uncertainties underlying learning samples. Also, fuzzy c-means clustering is used to assign membership degree to given samples efficiently. Moreover, the extended gap statistics is applied to select the optimal number of cluster. The proposed algorithm is evaluated on using several benchmark datasets and shows its good performance by combining support vector machine classifier.

Details

Database :
OpenAIRE
Journal :
2017 International Conference on Fuzzy Theory and Its Applications (iFUZZY)
Accession number :
edsair.doi...........f978b5769b70d5161e79a6d3746c0889
Full Text :
https://doi.org/10.1109/ifuzzy.2017.8311793