Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Imitation Bootstrapped Reinforcement Learning

Nov 20, 2023

Hengyuan Hu, Suvir Mirchandani, Dorsa Sadigh

Figure 1 for Imitation Bootstrapped Reinforcement Learning

Figure 2 for Imitation Bootstrapped Reinforcement Learning

Figure 3 for Imitation Bootstrapped Reinforcement Learning

Figure 4 for Imitation Bootstrapped Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:Despite the considerable potential of reinforcement learning (RL), robotics control tasks predominantly rely on imitation learning (IL) owing to its better sample efficiency. However, given the high cost of collecting extensive demonstrations, RL is still appealing if it can utilize limited imitation data for efficient autonomous self-improvement. Existing RL methods that utilize demonstrations either initialize the replay buffer with demonstrations and oversample them during RL training, which does not benefit from the generalization potential of modern IL methods, or pretrain the RL policy with IL on the demonstrations, which requires additional mechanisms to prevent catastrophic forgetting during RL fine-tuning. We propose imitation bootstrapped reinforcement learning (IBRL), a novel framework that first trains an IL policy on a limited number of demonstrations and then uses it to propose alternative actions for both online exploration and target value bootstrapping. IBRL achieves SoTA performance and sample efficiency on 7 challenging sparse reward continuous control tasks in simulation while learning directly from pixels. As a highlight of our method, IBRL achieves $6.4\times$ higher success rate than RLPD, a strong method that combines the idea of oversampling demonstrations with modern RL improvements, under the budget of 10 demos and 100K interactions in the challenging PickPlaceCan task in the Robomimic benchmark.

View paper on

Share this with someone who'll enjoy it:

Title:Imitation Bootstrapped Reinforcement Learning

Paper and Code