Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Bootstrapped model learning and error correction for planning with uncertainty in model-based RL

Apr 15, 2020

Alvaro Ovalle, Simon M. Lucas

Figure 1 for Bootstrapped model learning and error correction for planning with uncertainty in model-based RL

Figure 2 for Bootstrapped model learning and error correction for planning with uncertainty in model-based RL

Figure 3 for Bootstrapped model learning and error correction for planning with uncertainty in model-based RL

Figure 4 for Bootstrapped model learning and error correction for planning with uncertainty in model-based RL

Share this with someone who'll enjoy it:

Abstract:Having access to a forward model enables the use of planning algorithms such as Monte Carlo Tree Search and Rolling Horizon Evolution. Where a model is unavailable, a natural aim is to learn a model that reflects accurately the dynamics of the environment. In many situations it might not be possible and minimal glitches in the model may lead to poor performance and failure. This paper explores the problem of model misspecification through uncertainty-aware reinforcement learning agents. We propose a bootstrapped multi-headed neural network that learns the distribution of future states and rewards. We experiment with a number of schemes to extract the most likely predictions. Moreover, we also introduce a global error correction filter that applies high-level constraints guided by the context provided through the predictive distribution. We illustrate our approach on Minipacman. The evaluation demonstrates that when dealing with imperfect models, our methods exhibit increased performance and stability, both in terms of model accuracy and in its use within a planning algorithm.

View paper on

Share this with someone who'll enjoy it:

Title:Bootstrapped model learning and error correction for planning with uncertainty in model-based RL

Paper and Code