Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning

Nov 04, 2017

Richard Liaw, Sanjay Krishnan, Animesh Garg, Daniel Crankshaw, Joseph E. Gonzalez, Ken Goldberg

Figure 1 for Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning

Figure 2 for Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning

Figure 3 for Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning

Figure 4 for Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning

Share this with someone who'll enjoy it:

Abstract:Rather than learning new control policies for each new task, it is possible, when tasks share some structure, to compose a "meta-policy" from previously learned policies. This paper reports results from experiments using Deep Reinforcement Learning on a continuous-state, discrete-action autonomous driving simulator. We explore how Deep Neural Networks can represent meta-policies that switch among a set of previously learned policies, specifically in settings where the dynamics of a new scenario are composed of a mixture of previously learned dynamics and where the state observation is possibly corrupted by sensing noise. We also report the results of experiments varying dynamics mixes, distractor policies, magnitudes/distributions of sensing noise, and obstacles. In a fully observed experiment, the meta-policy learning algorithm achieves 2.6x the reward achieved by the next best policy composition technique with 80% less exploration. In a partially observed experiment, the meta-policy learning algorithm converges after 50 iterations while a direct application of RL fails to converge even after 200 iterations.

* 8 pages, 11 figures

View paper on

Share this with someone who'll enjoy it:

Title:Composing Meta-Policies for Autonomous Driving Using Hierarchical Deep Reinforcement Learning

Paper and Code