Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Non-Proportional Parametrizations for Stable Hypernetwork Learning

Apr 15, 2023

Jose Javier Gonzalez Ortiz, John Guttag, Adrian Dalca

Figure 1 for Non-Proportional Parametrizations for Stable Hypernetwork Learning

Figure 2 for Non-Proportional Parametrizations for Stable Hypernetwork Learning

Figure 3 for Non-Proportional Parametrizations for Stable Hypernetwork Learning

Figure 4 for Non-Proportional Parametrizations for Stable Hypernetwork Learning

Share this with someone who'll enjoy it:

Abstract:Hypernetworks are neural networks that generate the parameters of another neural network. In many scenarios, current hypernetwork training strategies are unstable, and convergence is often far slower than for non-hypernetwork models. We show that this problem is linked to an issue that arises when using common choices of hypernetwork architecture and initialization. We demonstrate analytically and experimentally how this numerical issue can lead to an instability during training that slows, and sometimes even prevents, convergence. We also demonstrate that popular deep learning normalization strategies fail to address these issues. We then propose a solution to the problem based on a revised hypernetwork formulation that uses non-proportional additive parametrizations. We test the proposed reparametrization on several tasks, and demonstrate that it consistently leads to more stable training, achieving faster convergence.

* Source code at https://github.com/JJGO/hyperlight

View paper on

Share this with someone who'll enjoy it:

Title:Non-Proportional Parametrizations for Stable Hypernetwork Learning

Paper and Code