Back to Search
Start Over
Interpreting Model Predictions with Constrained Perturbation and Counterfactual Instances.
- Source :
- International Journal of Pattern Recognition & Artificial Intelligence; Jan2022, Vol. 36 Issue 1, p1-23, 23p
- Publication Year :
- 2022
-
Abstract
- In recent years, machine learning models have achieved magnificent success in many industrial applications, but most of them are black boxes. It is crucial to understand why such predictions are made in many critical areas such as medicine, financial markets, and auto driving. In this paper, we propose Coco, a novel interpretation method which can interpret any binary classifier by assigning each feature an importance value for a particular prediction. We first adopt MixUp method to generate reasonable perturbations, then apply these perturbations with constraints to obtain counterfactual instances and finally compute a comprehensive metric on these instances to estimate the importance of each feature. To demonstrate the effectiveness of Coco, we conduct extensive experiments on several datasets. The results show our method achieves better performance in identifying the most important features compared with the state-of-the-art interpretation methods, including Shap and Lime. [ABSTRACT FROM AUTHOR]
- Subjects :
- COUNTERFACTUALS (Logic)
PREDICTION models
MACHINE learning
FINANCIAL markets
Subjects
Details
- Language :
- English
- ISSN :
- 02180014
- Volume :
- 36
- Issue :
- 1
- Database :
- Complementary Index
- Journal :
- International Journal of Pattern Recognition & Artificial Intelligence
- Publication Type :
- Academic Journal
- Accession number :
- 155550212
- Full Text :
- https://doi.org/10.1142/S0218001422510016