Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Granit Luzhnica

Acting upon Imagination: when to trust imagined trajectories in model based reinforcement learning

May 13, 2021

Adrian Remonda, Eduardo Veas, Granit Luzhnica

Figure 1 for Acting upon Imagination: when to trust imagined trajectories in model based reinforcement learning

Figure 2 for Acting upon Imagination: when to trust imagined trajectories in model based reinforcement learning

Figure 3 for Acting upon Imagination: when to trust imagined trajectories in model based reinforcement learning

Figure 4 for Acting upon Imagination: when to trust imagined trajectories in model based reinforcement learning

Abstract:Model based reinforcement learning (MBRL) uses an imperfect model of the world to imagine trajectories of future states and plan the best actions to maximize a reward function. These trajectories are imperfect and MBRL attempts to overcome this by relying on model predictive control (MPC) to continuously re-imagine trajectories from scratch. Such re-generation of imagined trajectories carries the major computational cost and increasing complexity in tasks with longer receding horizon. This paper aims to investigate how far in the future the imagined trajectories can be relied upon while still maintaining acceptable reward. Firstly, an error analysis is presented for systematic skipping recalculations for varying number of consecutive steps.% in several challenging benchmark control tasks. Secondly, we propose two methods offering when to trust and act upon imagined trajectories, looking at recent errors with respect to expectations, or comparing the confidence in an action imagined against its execution. Thirdly, we evaluate the effects of acting upon imagination while training the model of the world. Results show that acting upon imagination can reduce calculations by at least 20% and up to 80%, depending on the environment, while retaining acceptable reward.

Via

Access Paper or Ask Questions

Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data

Apr 22, 2021

Adrian Remonda, Sarah Krebs, Eduardo Veas, Granit Luzhnica, Roman Kern

Figure 1 for Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data

Figure 2 for Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data

Figure 3 for Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data

Figure 4 for Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data

Abstract:This paper explores the use of reinforcement learning (RL) models for autonomous racing. In contrast to passenger cars, where safety is the top priority, a racing car aims to minimize the lap-time. We frame the problem as a reinforcement learning task with a multidimensional input consisting of the vehicle telemetry, and a continuous action space. To find out which RL methods better solve the problem and whether the obtained models generalize to driving on unknown tracks, we put 10 variants of deep deterministic policy gradient (DDPG) to race in two experiments: i)~studying how RL methods learn to drive a racing car and ii)~studying how the learning scenario influences the capability of the models to generalize. Our studies show that models trained with RL are not only able to drive faster than the baseline open source handcrafted bots but also generalize to unknown tracks.

* IJCAI 2019 - Workshop on Scaling-Up Reinforcement Learning:SURL - Macau, China

Via

Access Paper or Ask Questions