Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Temporal Difference Models: Model-Free Deep RL for Model-Based Control

Feb 25, 2018

Vitchyr Pong, Shixiang Gu, Murtaza Dalal, Sergey Levine

Figure 1 for Temporal Difference Models: Model-Free Deep RL for Model-Based Control

Figure 2 for Temporal Difference Models: Model-Free Deep RL for Model-Based Control

Figure 3 for Temporal Difference Models: Model-Free Deep RL for Model-Based Control

Figure 4 for Temporal Difference Models: Model-Free Deep RL for Model-Based Control

Share this with someone who'll enjoy it:

Abstract:Model-free reinforcement learning (RL) is a powerful, general tool for learning complex behaviors. However, its sample efficiency is often impractically large for solving challenging real-world problems, even with off-policy algorithms such as Q-learning. A limiting factor in classic model-free RL is that the learning signal consists only of scalar rewards, ignoring much of the rich information contained in state transition tuples. Model-based RL uses this information, by training a predictive model, but often does not achieve the same asymptotic performance as model-free RL due to model bias. We introduce temporal difference models (TDMs), a family of goal-conditioned value functions that can be trained with model-free learning and used for model-based control. TDMs combine the benefits of model-free and model-based RL: they leverage the rich information in state transitions to learn very efficiently, while still attaining asymptotic performance that exceeds that of direct model-based RL methods. Our experimental results show that, on a range of continuous control tasks, TDMs provide a substantial improvement in efficiency compared to state-of-the-art model-based and model-free methods.

* To appear in ICLR 2018

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Temporal Difference Models: Model-Free Deep RL for Model-Based Control

Paper and Code