Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Learning to Learn with Generative Models of Neural Network Checkpoints

Sep 26, 2022

William Peebles, Ilija Radosavovic, Tim Brooks, Alexei A. Efros, Jitendra Malik

Figure 1 for Learning to Learn with Generative Models of Neural Network Checkpoints

Figure 2 for Learning to Learn with Generative Models of Neural Network Checkpoints

Figure 3 for Learning to Learn with Generative Models of Neural Network Checkpoints

Figure 4 for Learning to Learn with Generative Models of Neural Network Checkpoints

Share this with someone who'll enjoy it:

Abstract:We explore a data-driven approach for learning to optimize neural networks. We construct a dataset of neural network checkpoints and train a generative model on the parameters. In particular, our model is a conditional diffusion transformer that, given an initial input parameter vector and a prompted loss, error, or return, predicts the distribution over parameter updates that achieve the desired metric. At test time, it can optimize neural networks with unseen parameters for downstream tasks in just one update. We find that our approach successfully generates parameters for a wide range of loss prompts. Moreover, it can sample multimodal parameter solutions and has favorable scaling properties. We apply our method to different neural network architectures and tasks in supervised and reinforcement learning.

* Code available at https://www.github.com/wpeebles/G.pt . Project page and videos available at https://www.wpeebles.com/Gpt

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Learning to Learn with Generative Models of Neural Network Checkpoints

Paper and Code