Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tao Bian

Temporal-Differential Learning in Continuous Environments

Jun 01, 2020

Tao Bian, Zhong-Ping Jiang

Figure 1 for Temporal-Differential Learning in Continuous Environments

Figure 2 for Temporal-Differential Learning in Continuous Environments

Figure 3 for Temporal-Differential Learning in Continuous Environments

Figure 4 for Temporal-Differential Learning in Continuous Environments

Abstract:In this paper, a new reinforcement learning (RL) method known as the method of temporal differential is introduced. Compared to the traditional temporal-difference learning method, it plays a crucial role in developing novel RL techniques for continuous environments. In particular, the continuous-time least squares policy evaluation (CT-LSPE) and the continuous-time temporal-differential (CT-TD) learning methods are developed. Both theoretical and empirical evidences are provided to demonstrate the effectiveness of the proposed temporal-differential learning methodology.

Via

Access Paper or Ask Questions