Back to Search Start Over

Imputation-Based Ensemble Techniques for Class Imbalance Learning.

Authors :
Razavi-Far, Roozbeh
Farajzadeh-Zanajni, Maryam
Wang, Boyu
Saif, Mehrdad
Chakrabarti, Shiladitya
Source :
IEEE Transactions on Knowledge & Data Engineering. May2021, Vol. 33 Issue 5, p1988-2001. 14p.
Publication Year :
2021

Abstract

Correct classification of rare samples is a vital data mining task and of paramount importance in many research domains. This article mainly focuses on the development of the novel class-imbalance learning techniques, which make use of oversampling methods integrated with bagging and boosting ensembles. Two novel oversampling strategies based on the single and the multiple imputation methods are proposed. The proposed techniques aim to create useful synthetic minority class samples, similar to the original minority class samples, by estimation of missing values that are already induced in the minority class samples. The re-balanced datasets are then used to train base-learners of the ensemble algorithms. In addition, the proposed techniques are compared with the commonly used class imbalance learning methods in terms of three performance metrics including AUC, F-measure, and G-mean over several synthetic binary class datasets. The empirical results show that the proposed multiple imputation-based oversampling combined with bagging significantly outperforms other competitors. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
10414347
Volume :
33
Issue :
5
Database :
Academic Search Index
Journal :
IEEE Transactions on Knowledge & Data Engineering
Publication Type :
Academic Journal
Accession number :
149773612
Full Text :
https://doi.org/10.1109/TKDE.2019.2951556