Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning Setup Policies: Reliable Transition Between Locomotion Behaviours

Jan 23, 2021

Brendan Tidd, Nicolas Hudson, Akansel Cosgun, Jurgen Leitner

Figure 1 for Learning Setup Policies: Reliable Transition Between Locomotion Behaviours

Figure 2 for Learning Setup Policies: Reliable Transition Between Locomotion Behaviours

Figure 3 for Learning Setup Policies: Reliable Transition Between Locomotion Behaviours

Figure 4 for Learning Setup Policies: Reliable Transition Between Locomotion Behaviours

Share this with someone who'll enjoy it:

Abstract:Dynamic platforms that operate over manyunique terrain conditions typically require multiple controllers.To transition safely between controllers, there must be anoverlap of states between adjacent controllers. We developa novel method for training Setup Policies that bridge thetrajectories between pre-trained Deep Reinforcement Learning(DRL) policies. We demonstrate our method with a simulatedbiped traversing a difficult jump terrain, where a single policyfails to learn the task, and switching between pre-trainedpolicies without Setup Policies also fails. We perform anablation of key components of our system, and show thatour method outperforms others that learn transition policies.We demonstrate our method with several difficult and diverseterrain types, and show that we can use Setup Policies as partof a modular control suite to successfully traverse a sequence ofcomplex terrains. We show that using Setup Policies improvesthe success rate for traversing a single difficult jump terrain(from 1.5%success rate without Setup Policies to 82%), and asequence of various terrains (from 6.5%without Setup Policiesto 29.1%).

* Submitted to Humanoids 2020

View paper on

Share this with someone who'll enjoy it:

Title:Learning Setup Policies: Reliable Transition Between Locomotion Behaviours

Paper and Code