Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Goal-Conditioned Generators of Deep Policies

Jul 04, 2022

Francesco Faccio, Vincent Herrmann, Aditya Ramesh, Louis Kirsch, Jürgen Schmidhuber

Figure 1 for Goal-Conditioned Generators of Deep Policies

Figure 2 for Goal-Conditioned Generators of Deep Policies

Figure 3 for Goal-Conditioned Generators of Deep Policies

Figure 4 for Goal-Conditioned Generators of Deep Policies

Share this with someone who'll enjoy it:

Abstract:Goal-conditioned Reinforcement Learning (RL) aims at learning optimal policies, given goals encoded in special command inputs. Here we study goal-conditioned neural nets (NNs) that learn to generate deep NN policies in form of context-specific weight matrices, similar to Fast Weight Programmers and other methods from the 1990s. Using context commands of the form "generate a policy that achieves a desired expected return," our NN generators combine powerful exploration of parameter space with generalization across commands to iteratively find better and better policies. A form of weight-sharing HyperNetworks and policy embeddings scales our method to generate deep NNs. Experiments show how a single learned policy generator can produce policies that achieve any return seen during training. Finally, we evaluate our algorithm on a set of continuous control tasks where it exhibits competitive performance. Our code is public.

* Preprint. Under Review

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Goal-Conditioned Generators of Deep Policies

Paper and Code