Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Variational Inference of overparameterized Bayesian Neural Networks: a theoretical and empirical study

Jul 08, 2022

Tom Huix, Szymon Majewski, Alain Durmus, Eric Moulines, Anna Korba

Figure 1 for Variational Inference of overparameterized Bayesian Neural Networks: a theoretical and empirical study

Figure 2 for Variational Inference of overparameterized Bayesian Neural Networks: a theoretical and empirical study

Figure 3 for Variational Inference of overparameterized Bayesian Neural Networks: a theoretical and empirical study

Figure 4 for Variational Inference of overparameterized Bayesian Neural Networks: a theoretical and empirical study

Share this with someone who'll enjoy it:

Abstract:This paper studies the Variational Inference (VI) used for training Bayesian Neural Networks (BNN) in the overparameterized regime, i.e., when the number of neurons tends to infinity. More specifically, we consider overparameterized two-layer BNN and point out a critical issue in the mean-field VI training. This problem arises from the decomposition of the lower bound on the evidence (ELBO) into two terms: one corresponding to the likelihood function of the model and the second to the Kullback-Leibler (KL) divergence between the prior distribution and the variational posterior. In particular, we show both theoretically and empirically that there is a trade-off between these two terms in the overparameterized regime only when the KL is appropriately re-scaled with respect to the ratio between the the number of observations and neurons. We also illustrate our theoretical results with numerical experiments that highlight the critical choice of this ratio.

View paper on

Share this with someone who'll enjoy it:

Title:Variational Inference of overparameterized Bayesian Neural Networks: a theoretical and empirical study

Paper and Code