Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Continual Model-Based Reinforcement Learning with Hypernetworks

Sep 25, 2020

Yizhou Huang, Kevin Xie, Homanga Bharadhwaj, Florian Shkurti

Figure 1 for Continual Model-Based Reinforcement Learning with Hypernetworks

Figure 2 for Continual Model-Based Reinforcement Learning with Hypernetworks

Figure 3 for Continual Model-Based Reinforcement Learning with Hypernetworks

Figure 4 for Continual Model-Based Reinforcement Learning with Hypernetworks

Share this with someone who'll enjoy it:

Abstract:Effective planning in model-based reinforcement learning (MBRL) and model-predictive control (MPC) relies on the accuracy of the learned dynamics model. In many instances of MBRL and MPC, this model is assumed to be stationary and is periodically re-trained from scratch on state transition experience collected from the beginning of environment interactions. This implies that the time required to train the dynamics model - and the pause required between plan executions - grows linearly with the size of the collected experience. We argue that this is too slow for lifelong robot learning and propose HyperCRL, a method that continually learns the encountered dynamics in a sequence of tasks using task-conditional hypernetworks. Our method has three main attributes: first, it enables constant-time dynamics learning sessions between planning and only needs to store the most recent fixed-size portion of the state transition experience; second, it uses fixed-capacity hypernetworks to represent non-stationary and task-aware dynamics; third, it outperforms existing continual learning alternatives that rely on fixed-capacity networks, and does competitively with baselines that remember an ever increasing coreset of past experience. We show that HyperCRL is effective in continual model-based reinforcement learning in robot locomotion and manipulation scenarios, such as tasks involving pushing and door opening. Our project website with code and videos is at this link http://rvl.cs.toronto.edu/blog/2020/hypercrl/

* 13 pages, 6 figures. Preliminary report, under review

View paper on

Share this with someone who'll enjoy it:

Title:Continual Model-Based Reinforcement Learning with Hypernetworks

Paper and Code