Back to Search
Start Over
Policy Optimization with Sparse Global Contrastive Explanations
- Publication Year :
- 2022
-
Abstract
- We develop a Reinforcement Learning (RL) framework for improving an existing behavior policy via sparse, user-interpretable changes. Our goal is to make minimal changes while gaining as much benefit as possible. We define a minimal change as having a sparse, global contrastive explanation between the original and proposed policy. We improve the current policy with the constraint of keeping that global contrastive explanation short. We demonstrate our framework with a discrete MDP and a continuous 2D navigation domain.<br />Comment: Accepted at IMLH Workshop, ICML 2022
- Subjects :
- Computer Science - Machine Learning
Subjects
Details
- Database :
- arXiv
- Publication Type :
- Report
- Accession number :
- edsarx.2207.06269
- Document Type :
- Working Paper