1. Counterfactual explanation of Bayesian model uncertainty
- Author
-
Tehseen Zia, Álvaro Rocha, Muhammad Ilyas, Feras N. Al-Obeidat, Abdallah Tubaishat, and Gohar Ali
- Subjects
Counterfactual thinking ,Counterfactual conditional ,Artificial neural network ,business.industry ,Computer science ,Deep learning ,Bayesian probability ,Machine learning ,computer.software_genre ,Bayesian inference ,Artificial Intelligence ,Softmax function ,Artificial intelligence ,business ,computer ,Software ,MNIST database - Abstract
Artificial intelligence systems are becoming ubiquitous in everyday life as well as in high-risk environments, such as autonomous driving, medical treatment, and medicine. The opaque nature of the deep neural network raises concerns about its adoption in high-risk environments. It is important for researchers to explain how these models reach their decisions. Most of the existing methods rely on softmax to explain model decisions. However, softmax is shown to be often misleading, particularly giving unjustified high confidence even for samples far from the training data. To overcome this shortcoming, we propose Bayesian model uncertainty for producing counterfactual explanations. In this paper, we compare the counterfactual explanation of models based on Bayesian uncertainty and softmax score. This work predictively produces minimal important features, which maximally change classifier output to explain the decision-making process of the Bayesian model. We used MNIST and Caltech Bird 2011 datasets for experiments. The results show that the Bayesian model outperforms the softmax model and produces more concise and human-understandable counterfactuals.
- Published
- 2021