Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Daniel Simpson

Scalable Bayesian Learning with posteriors

May 31, 2024

Samuel Duffield, Kaelan Donatella, Johnathan Chiu, Phoebe Klett, Daniel Simpson

Abstract:Although theoretically compelling, Bayesian learning with modern machine learning models is computationally challenging since it requires approximating a high dimensional posterior distribution. In this work, we (i) introduce posteriors, an easily extensible PyTorch library hosting general-purpose implementations making Bayesian learning accessible and scalable to large data and parameter regimes; (ii) present a tempered framing of stochastic gradient Markov chain Monte Carlo, as implemented in posteriors, that transitions seamlessly into optimization and unveils a minor modification to deep ensembles to ensure they are asymptotically unbiased for the Bayesian posterior, and (iii) demonstrate and compare the utility of Bayesian approximations through experiments including an investigation into the cold posterior effect and applications with large language models.

Via

Access Paper or Ask Questions

Yes, but Did It Work?: Evaluating Variational Inference

Jul 07, 2018

Yuling Yao, Aki Vehtari, Daniel Simpson, Andrew Gelman

Figure 1 for Yes, but Did It Work?: Evaluating Variational Inference

Figure 2 for Yes, but Did It Work?: Evaluating Variational Inference

Figure 3 for Yes, but Did It Work?: Evaluating Variational Inference

Figure 4 for Yes, but Did It Work?: Evaluating Variational Inference

Abstract:While it's always possible to compute a variational approximation to a posterior distribution, it can be difficult to discover problems with this approximation. We propose two diagnostic algorithms to alleviate this problem. The Pareto-smoothed importance sampling (PSIS) diagnostic gives a goodness of fit measurement for joint distributions, while simultaneously improving the error in the estimate. The variational simulation-based calibration (VSBC) assesses the average performance of point estimates.

* Proceedings of the 35th International Conference on Machine Learning, PMLR 80:5581-5590, 2018. http://proceedings.mlr.press/v80/yao18a.html
* Appearing at International Conference on Machine Learning 2018

Via

Access Paper or Ask Questions

On Russian Roulette Estimates for Bayesian Inference with Doubly-Intractable Likelihoods

Dec 10, 2015

Anne-Marie Lyne, Mark Girolami, Yves Atchadé, Heiko Strathmann, Daniel Simpson

Figure 1 for On Russian Roulette Estimates for Bayesian Inference with Doubly-Intractable Likelihoods

Figure 2 for On Russian Roulette Estimates for Bayesian Inference with Doubly-Intractable Likelihoods

Figure 3 for On Russian Roulette Estimates for Bayesian Inference with Doubly-Intractable Likelihoods

Figure 4 for On Russian Roulette Estimates for Bayesian Inference with Doubly-Intractable Likelihoods

Abstract:A large number of statistical models are "doubly-intractable": the likelihood normalising term, which is a function of the model parameters, is intractable, as well as the marginal likelihood (model evidence). This means that standard inference techniques to sample from the posterior, such as Markov chain Monte Carlo (MCMC), cannot be used. Examples include, but are not confined to, massive Gaussian Markov random fields, autologistic models and Exponential random graph models. A number of approximate schemes based on MCMC techniques, Approximate Bayesian computation (ABC) or analytic approximations to the posterior have been suggested, and these are reviewed here. Exact MCMC schemes, which can be applied to a subset of doubly-intractable distributions, have also been developed and are described in this paper. As yet, no general method exists which can be applied to all classes of models with doubly-intractable posteriors. In addition, taking inspiration from the Physics literature, we study an alternative method based on representing the intractable likelihood as an infinite series. Unbiased estimates of the likelihood can then be obtained by finite time stochastic truncation of the series via Russian Roulette sampling, although the estimates are not necessarily positive. Results from the Quantum Chromodynamics literature are exploited to allow the use of possibly negative estimates in a pseudo-marginal MCMC scheme such that expectations with respect to the posterior distribution are preserved. The methodology is reviewed on well-known examples such as the parameters in Ising models, the posterior for Fisher-Bingham distributions on the $d$-Sphere and a large-scale Gaussian Markov Random Field model describing the Ozone Column data. This leads to a critical assessment of the strengths and weaknesses of the methodology with pointers to ongoing research.

* Statistical Science 2015, Vol. 30, No. 4, 443-467
* Published at http://dx.doi.org/10.1214/15-STS523 in the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

Via

Access Paper or Ask Questions