Back to Search Start Over

Explanation sets: A general framework for machine learning explainability.

Authors :
R. Fernández, Rubén
Martín de Diego, Isaac
M. Moguerza, Javier
Herrera, Francisco
Source :
Information Sciences. Dec2022, Vol. 617, p464-481. 18p.
Publication Year :
2022

Abstract

• Explanation Sets, a new framework that unifies counterfactuals and semifactuals. • Counterfactuals and semifactuals are defined in terms of a similarity measure. • Restrictions and preferences over the explanations are defined using a neighborhood. • A taxonomy for set-based representations. Explainable Machine Learning (ML) is an emerging field of Artificial Intelligence that has gained popularity in the last decade. It focuses on explaining ML models and their predictions, enabling people to understand the rationale behind them. Counterfactuals and semifactuals are two instances of Explainable ML techniques that explain model predictions using other observations. These techniques are based on the comparison between the observation to be explained and another one. In counterfactuals, their prediction is different, and in semifactuals, it is the same. Both techniques have been studied in the Social Sciences and Explainable ML communities, and they have different use cases and properties. In this paper, the Explanation Set framework, an approach that unifies counterfactuals and semifactuals, is introduced. Explanation Sets are example-based explanations defined in a neighborhood where most observations satisfy a grouping measure. The neighborhood allows defining and combining restrictions. The grouping measure determines if the explanations are counterfactuals (dissimilarity) or semifactuals (similarity). Besides providing a unified framework, the major strength of the proposal is to extend these explanations to other tasks such as regression by using an appropriate grouping measure. The proposal is validated in a regression and classification task using several neighborhoods and grouping measures. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
00200255
Volume :
617
Database :
Academic Search Index
Journal :
Information Sciences
Publication Type :
Periodical
Accession number :
161014303
Full Text :
https://doi.org/10.1016/j.ins.2022.10.084