Back to Search Start Over

Utility-Based Reinforcement Learning: Unifying Single-objective and Multi-objective Reinforcement Learning

Authors :
Vamplew, Peter
Foale, Cameron
Hayes, Conor F.
Mannion, Patrick
Howley, Enda
Dazeley, Richard
Johnson, Scott
Källström, Johan
Ramos, Gabriel
Rădulescu, Roxana
Röpke, Willem
Roijers, Diederik M.
Publication Year :
2024

Abstract

Research in multi-objective reinforcement learning (MORL) has introduced the utility-based paradigm, which makes use of both environmental rewards and a function that defines the utility derived by the user from those rewards. In this paper we extend this paradigm to the context of single-objective reinforcement learning (RL), and outline multiple potential benefits including the ability to perform multi-policy learning across tasks relating to uncertain objectives, risk-aware RL, discounting, and safe RL. We also examine the algorithmic implications of adopting a utility-based approach.<br />Comment: Accepted for the Blue Sky Track at AAMAS'24

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2402.02665
Document Type :
Working Paper