Back to Search
Start Over
Synthetic minority over-sampling technique based on fuzzy c-means clustering for imbalanced data
- Source :
- 2017 International Conference on Fuzzy Theory and Its Applications (iFUZZY).
- Publication Year :
- 2017
- Publisher :
- IEEE, 2017.
-
Abstract
- Some solutions to solve the class imbalance problem that is one of the representative difficulties in machine learning have been proposed. Among them, SMOTE algorithm is proposed recently to reduce the influence of the problem and shows its remarkable performance to solve real world problems. This paper proposes a novel kind of the SMOTE algorithm by combining the previous SMOTE algorithm and fuzzy logic to deal with uncertainties underlying learning samples. Also, fuzzy c-means clustering is used to assign membership degree to given samples efficiently. Moreover, the extended gap statistics is applied to select the optimal number of cluster. The proposed algorithm is evaluated on using several benchmark datasets and shows its good performance by combining support vector machine classifier.
- Subjects :
- Degree (graph theory)
Computer science
02 engineering and technology
computer.software_genre
Imbalanced data
Fuzzy logic
Class imbalance
020204 information systems
Support vector machine classifier
0202 electrical engineering, electronic engineering, information engineering
Benchmark (computing)
Oversampling
020201 artificial intelligence & image processing
Data mining
Cluster analysis
computer
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2017 International Conference on Fuzzy Theory and Its Applications (iFUZZY)
- Accession number :
- edsair.doi...........f978b5769b70d5161e79a6d3746c0889
- Full Text :
- https://doi.org/10.1109/ifuzzy.2017.8311793