Data-Driven Nonzero-Sum Game for Discrete-Time Systems Using Off-Policy Reinforcement Learning

Authors :: Yongliang Yang
Sen Zhang
Jie Dong
Yixin Yin
Source :: IEEE Access, Vol 8, Pp 14074-14088 (2020)
Publication Year :: 2020
Publisher :: IEEE, 2020.
Abstract: In this paper, we develop a data-driven algorithm to learn the Nash equilibrium solution for a two-player non-zero-sum (NZS) game with completely unknown linear discrete-time dynamics based on off-policy reinforcement learning (RL). This algorithm solves the coupled algebraic Riccati equations (CARE) forward in time in a model-free manner by using the online measured data. We first derive the CARE for solving the two-player NZS game. Then, model-free off-policy RL is developed to obviate the requirement of complete knowledge of system dynamics. Besides, on- and off-policy RL algorithms are compared in terms of the robustness against the probing noise. Finally, a simulation example is presented to show the efficacy of the presented approach.

Subjects :: Coupled algebraic Riccati equations
off-policy reinforcement learning
nonzero-sum game
Electrical engineering. Electronics. Nuclear engineering
TK1-9971

Full Text Access

Tools