Interpreting Model Predictions with Constrained Perturbation and Counterfactual Instances.

Authors :: Fang, Jun-Peng
Zhou, Jun
Cui, Qing
Tang, Cai-Zhi
Li, Long-Fei
Source :: International Journal of Pattern Recognition & Artificial Intelligence; Jan2022, Vol. 36 Issue 1, p1-23, 23p
Publication Year :: 2022
Abstract: In recent years, machine learning models have achieved magnificent success in many industrial applications, but most of them are black boxes. It is crucial to understand why such predictions are made in many critical areas such as medicine, financial markets, and auto driving. In this paper, we propose Coco, a novel interpretation method which can interpret any binary classifier by assigning each feature an importance value for a particular prediction. We first adopt MixUp method to generate reasonable perturbations, then apply these perturbations with constraints to obtain counterfactual instances and finally compute a comprehensive metric on these instances to estimate the importance of each feature. To demonstrate the effectiveness of Coco, we conduct extensive experiments on several datasets. The results show our method achieves better performance in identifying the most important features compared with the state-of-the-art interpretation methods, including Shap and Lime. [ABSTRACT FROM AUTHOR]

Subjects :: COUNTERFACTUALS (Logic)
PREDICTION models
MACHINE learning
FINANCIAL markets

Language :: English
ISSN :: 02180014
Volume :: 36
Issue :: 1
Database :: Complementary Index
Journal :: International Journal of Pattern Recognition & Artificial Intelligence
Publication Type :: Academic Journal
Accession number :: 155550212
Full Text :: https://doi.org/10.1142/S0218001422510016

Full Text Access

Tools