Back to Search Start Over

An inverse classification framework with limited budget and maximum number of perturbed samples.

Authors :
Koo, Jaehoon
Klabjan, Diego
Utke, Jean
Source :
Expert Systems with Applications. Feb2023, Vol. 212, pN.PAG-N.PAG. 1p.
Publication Year :
2023

Abstract

Most recent machine learning research focuses on developing new classifiers for the sake of improving classification accuracy. With many well-performing state-of-the-art classifiers available, there is a growing need for understanding interpretability of a classifier necessitated by practical purposes such as to find the best diet recommendation for a diabetes patient. Inverse classification is a post modeling process to find changes in input features of samples to alter the initially predicted class. It is useful in many business applications to determine how to adjust a sample input data such that the classifier predicts it to be in a desired class. In real world applications, a budget on perturbations of samples corresponding to customers or patients is usually considered, and in this setting, the number of successfully perturbed samples is key to increase benefits. In this study, we propose a new framework to solve inverse classification that maximizes the number of perturbed samples subject to a per-feature-budget limits and favorable classification classes of the perturbed samples. We design algorithms to solve this optimization problem based on gradient methods, stochastic processes, Lagrangian relaxations, and the Gumbel trick. In experiments, we find that our algorithms based on stochastic processes exhibit an excellent performance in different budget settings and they scale well. The relative improvement of the proposed stochastic algorithms over an existing method with a traditional formulation is 15% in the real-world dataset and 21% in two public datasets on average. • A new framework to solve inverse classification is developed. • It obtains successfully perturbed samples within a per-feature budget. • Algorithms are designed based on a gradient method and the Gumbel trick. • Real data from an industrial partner and two healthcare public datasets are used. • The stochastic approaches perform great on various budget scenarios, and they scale. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09574174
Volume :
212
Database :
Academic Search Index
Journal :
Expert Systems with Applications
Publication Type :
Academic Journal
Accession number :
159981742
Full Text :
https://doi.org/10.1016/j.eswa.2022.118761