Back to Search Start Over

Model-agnostic feature importance and effects with dependent features: a conditional subgroup approach.

Authors :
Molnar, Christoph
König, Gunnar
Bischl, Bernd
Casalicchio, Giuseppe
Source :
Data Mining & Knowledge Discovery; Sep2024, Vol. 38 Issue 5, p2903-2941, 39p
Publication Year :
2024

Abstract

The interpretation of feature importance in machine learning models is challenging when features are dependent. Permutation feature importance (PFI) ignores such dependencies, which can cause misleading interpretations due to extrapolation. A possible remedy is more advanced conditional PFI approaches that enable the assessment of feature importance conditional on all other features. Due to this shift in perspective and in order to enable correct interpretations, it is beneficial if the conditioning is transparent and comprehensible. In this paper, we propose a new sampling mechanism for the conditional distribution based on permutations in conditional subgroups. As these subgroups are constructed using tree-based methods such as transformation trees, the conditioning becomes inherently interpretable. This not only provides a simple and effective estimator of conditional PFI, but also local PFI estimates within the subgroups. In addition, we apply the conditional subgroups approach to partial dependence plots, a popular method for describing feature effects that can also suffer from extrapolation when features are dependent and interactions are present in the model. In simulations and a real-world application, we demonstrate the advantages of the conditional subgroup approach over existing methods: It allows to compute conditional PFI that is more true to the data than existing proposals and enables a fine-grained interpretation of feature effects and importance within the conditional subgroups. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
13845810
Volume :
38
Issue :
5
Database :
Complementary Index
Journal :
Data Mining & Knowledge Discovery
Publication Type :
Academic Journal
Accession number :
179357246
Full Text :
https://doi.org/10.1007/s10618-022-00901-9