Back to Search
Start Over
Ensemble model that minimizes the misclassification cost for imbalanced class credit data and its explanation using LIME.
- Source :
- AIP Conference Proceedings; 6/24/2022, Vol. 2465 Issue 1, p1-11, 11p
- Publication Year :
- 2022
-
Abstract
- Recently ensemble models have been proposed for credit scoring applications to alleviate the negative impact of imbalanced class data that can cause deterioration in prediction performance. Ensemble model have shown promising results in terms of model accuracy in previous studies. However, rather than model accuracy, the ultimate interest in credit scoring is to minimize the financial losses that can be more precisely quantified using total misclassification cost (TMC). Despite TMC is thought to be an effective indicator to assess the financial losses in credit scoring, unfortunately, TMC has yet been adequately studied for imbalanced credit data. This paper investigates the performance of several representative ensemble models over the imbalanced data using TMC as a model assessment tool. In addition, due to the fact that the bank is highly conservative in using new credit scoring models, we employ the LIME technique to interpret the best-selected ensemble model and provide some understanding of the predicted results. The finding shows that the ensemble model, particularly the bagging model, can provide a minimum TMC and higher discriminality power for imbalanced class data compared to the benchmark models (i.e. single classifier). Furthermore, LIME offers a reasonable interpretation of the prediction based on higher R<superscript>2</superscript> value produced. [ABSTRACT FROM AUTHOR]
- Subjects :
- CREDIT ratings
ECONOMIC indicators
COST
EXPLANATION
Subjects
Details
- Language :
- English
- ISSN :
- 0094243X
- Volume :
- 2465
- Issue :
- 1
- Database :
- Complementary Index
- Journal :
- AIP Conference Proceedings
- Publication Type :
- Conference
- Accession number :
- 157629695
- Full Text :
- https://doi.org/10.1063/5.0078751