Back to Search
Start Over
A New Majority Weighted Minority Oversampling Technique for Classification of Imbalanced Datasets
- Source :
- 2020 International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE).
- Publication Year :
- 2020
- Publisher :
- IEEE, 2020.
-
Abstract
- Classification problem is one of the essential tasks in data mining. Traditional classification strategies are predominantly via cost-insensitive equilibrium data. They tend to be concentrated on the overall accuracy of a model, and such classifiers are improper for unbalanced sample data. Hence, optimizing unbalanced samples to improve classifier performance is an issue worthy of discussion. Based on the information-rich minority samples that are difficult to learn, Majority Weighted Minority Oversampling Technique (MWMOTE) uses the clustering method to generate synthetic samples from the weighted information samples. However, the accuracy of the clustering should be optimized. To this end, a method called NC_Link_MWMOTE is presented for efficiently handling imbalanced learning problems. We propose a solution by using NC_Link-based hierarchical clustering method to synthesize different samples from a small number of samples, thus optimizing the clustering effect. NC_Link_MWMOTE was evaluated on six different levels of equilibrium data sets. The simulation results show that our method is effective and outperforms competitive baseline method in terms of various assessment metrics, such as Fl-score and Area Under Curve (AUC).
- Subjects :
- business.industry
Computer science
Pattern recognition
02 engineering and technology
Hierarchical clustering
Data set
Statistical classification
ComputingMethodologies_PATTERNRECOGNITION
020204 information systems
0202 electrical engineering, electronic engineering, information engineering
Oversampling
020201 artificial intelligence & image processing
Artificial intelligence
Cluster analysis
business
Classifier (UML)
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- 2020 International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE)
- Accession number :
- edsair.doi...........5a7aafa315b3b4672af470983ec52c9c