Start Over

An inverse classification framework with limited budget and maximum number of perturbed samples.

Authors :: Koo, Jaehoon
Klabjan, Diego
Utke, Jean
Source :: Expert Systems with Applications. Feb2023, Vol. 212, pN.PAG-N.PAG. 1p.
Publication Year :: 2023
Abstract: Most recent machine learning research focuses on developing new classifiers for the sake of improving classification accuracy. With many well-performing state-of-the-art classifiers available, there is a growing need for understanding interpretability of a classifier necessitated by practical purposes such as to find the best diet recommendation for a diabetes patient. Inverse classification is a post modeling process to find changes in input features of samples to alter the initially predicted class. It is useful in many business applications to determine how to adjust a sample input data such that the classifier predicts it to be in a desired class. In real world applications, a budget on perturbations of samples corresponding to customers or patients is usually considered, and in this setting, the number of successfully perturbed samples is key to increase benefits. In this study, we propose a new framework to solve inverse classification that maximizes the number of perturbed samples subject to a per-feature-budget limits and favorable classification classes of the perturbed samples. We design algorithms to solve this optimization problem based on gradient methods, stochastic processes, Lagrangian relaxations, and the Gumbel trick. In experiments, we find that our algorithms based on stochastic processes exhibit an excellent performance in different budget settings and they scale well. The relative improvement of the proposed stochastic algorithms over an existing method with a traditional formulation is 15% in the real-world dataset and 21% in two public datasets on average. • A new framework to solve inverse classification is developed. • It obtains successfully perturbed samples within a per-feature budget. • Algorithms are designed based on a gradient method and the Gumbel trick. • Real data from an industrial partner and two healthcare public datasets are used. • The stochastic approaches perform great on various budget scenarios, and they scale. [ABSTRACT FROM AUTHOR]

Subjects :: *BUDGET
*STOCHASTIC processes
*MACHINE learning
*CLASSIFICATION
*PEOPLE with diabetes

Details

Language :: English
ISSN :: 09574174
Volume :: 212
Database :: Academic Search Index
Journal :: Expert Systems with Applications
Publication Type :: Academic Journal
Accession number :: 159981742
Full Text :: https://doi.org/10.1016/j.eswa.2022.118761

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

An inverse classification framework with limited budget and maximum number of perturbed samples.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

An inverse classification framework with limited budget and maximum number of perturbed samples.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources