Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Scaling up budgeted reinforcement learning

Mar 06, 2019

Nicolas Carrara, Edouard Leurent, Romain Laroche, Tanguy Urvoy, Odalric-Ambrym Maillard, Olivier Pietquin

Figure 1 for Scaling up budgeted reinforcement learning

Figure 2 for Scaling up budgeted reinforcement learning

Figure 3 for Scaling up budgeted reinforcement learning

Figure 4 for Scaling up budgeted reinforcement learning

Share this with someone who'll enjoy it:

Abstract:Can we learn a control policy able to adapt its behaviour in real time so as to take any desired amount of risk? The general Reinforcement Learning framework solely aims at optimising a total reward in expectation, which may not be desirable in critical applications. In stark contrast, the Budgeted Markov Decision Process (BMDP) framework is a formalism in which the notion of risk is implemented as a hard constraint on a failure signal. Existing algorithms solving BMDPs rely on strong assumptions and have so far only been applied to toy-examples. In this work, we relax some of these assumptions and demonstrate the scalability of our approach on two practical problems: a spoken dialogue system and an autonomous driving task. On both examples, we reach similar performances as Lagrangian Relaxation methods with a significant improvement in sample and memory efficiency.

* N.Carrara and E.Leurent have equally contributed. The source code, videos and additional details for all experiments are available at https://scaling-up-brl.github.io

View paper on

Share this with someone who'll enjoy it:

Title:Scaling up budgeted reinforcement learning

Paper and Code