Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Structured Stochastic Gradient MCMC

Jul 19, 2021

Antonios Alexos, Alex Boyd, Stephan Mandt

Figure 1 for Structured Stochastic Gradient MCMC

Figure 2 for Structured Stochastic Gradient MCMC

Figure 3 for Structured Stochastic Gradient MCMC

Figure 4 for Structured Stochastic Gradient MCMC

Share this with someone who'll enjoy it:

Abstract:Stochastic gradient Markov chain Monte Carlo (SGMCMC) is considered the gold standard for Bayesian inference in large-scale models, such as Bayesian neural networks. Since practitioners face speed versus accuracy tradeoffs in these models, variational inference (VI) is often the preferable option. Unfortunately, VI makes strong assumptions on both the factorization and functional form of the posterior. In this work, we propose a new non-parametric variational approximation that makes no assumptions about the approximate posterior's functional form and allows practitioners to specify the exact dependencies the algorithm should respect or break. The approach relies on a new Langevin-type algorithm that operates on a modified energy function, where parts of the latent variables are averaged over samples from earlier iterations of the Markov chain. This way, statistical dependencies can be broken in a controlled way, allowing the chain to mix faster. This scheme can be further modified in a ''dropout'' manner, leading to even more scalability. By implementing the scheme on a ResNet-20 architecture, we obtain better predictive likelihoods and larger effective sample sizes than full SGMCMC.

View paper on

OpenReview

Share this with someone who'll enjoy it:

Title:Structured Stochastic Gradient MCMC

Paper and Code