Back to Search Start Over

Temporal-Differential Learning in Continuous Environments

Authors :
Bian, Tao
Jiang, Zhong-Ping
Publication Year :
2020

Abstract

In this paper, a new reinforcement learning (RL) method known as the method of temporal differential is introduced. Compared to the traditional temporal-difference learning method, it plays a crucial role in developing novel RL techniques for continuous environments. In particular, the continuous-time least squares policy evaluation (CT-LSPE) and the continuous-time temporal-differential (CT-TD) learning methods are developed. Both theoretical and empirical evidences are provided to demonstrate the effectiveness of the proposed temporal-differential learning methodology.

Details

Database :
arXiv
Publication Type :
Report
Accession number :
edsarx.2006.00997
Document Type :
Working Paper