Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Peter Carbonetto

Gradient-based optimization for variational empirical Bayes multiple regression

Nov 21, 2024

Saikat Banerjee, Peter Carbonetto, Matthew Stephens

Abstract:Variational empirical Bayes (VEB) methods provide a practically attractive approach to fitting large, sparse, multiple regression models. These methods usually use coordinate ascent to optimize the variational objective function, an approach known as coordinate ascent variational inference (CAVI). Here we propose alternative optimization approaches based on gradient-based (quasi-Newton) methods, which we call gradient-based variational inference (GradVI). GradVI exploits a recent result from Kim et. al. [arXiv:2208.10910] which writes the VEB regression objective function as a penalized regression. Unfortunately the penalty function is not available in closed form, and we present and compare two approaches to dealing with this problem. In simple situations where CAVI performs well, we show that GradVI produces similar predictive performance, and GradVI converges in fewer iterations when the predictors are highly correlated. Furthermore, unlike CAVI, the key computations in GradVI are simple matrix-vector products, and so GradVI is much faster than CAVI in settings where the design matrix admits fast matrix-vector products (e.g., as we show here, trendfiltering applications) and lends itself to parallelized implementations in ways that CAVI does not. GradVI is also very flexible, and could exploit automatic differentiation to easily implement different prior families. Our methods are implemented in an open-source Python software, GradVI (available from https://github.com/stephenslab/gradvi ).

Via

Access Paper or Ask Questions

A flexible empirical Bayes approach to multiple linear regression and connections with penalized regression

Aug 23, 2022

Youngseok Kim, Wei Wang, Peter Carbonetto, Matthew Stephens

Figure 1 for A flexible empirical Bayes approach to multiple linear regression and connections with penalized regression

Figure 2 for A flexible empirical Bayes approach to multiple linear regression and connections with penalized regression

Figure 3 for A flexible empirical Bayes approach to multiple linear regression and connections with penalized regression

Figure 4 for A flexible empirical Bayes approach to multiple linear regression and connections with penalized regression

Abstract:We introduce a new empirical Bayes approach for large-scale multiple linear regression. Our approach combines two key ideas: (i) the use of flexible "adaptive shrinkage" priors, which approximate the nonparametric family of scale mixture of normal distributions by a finite mixture of normal distributions; and (ii) the use of variational approximations to efficiently estimate prior hyperparameters and compute approximate posteriors. Combining these two ideas results in fast and flexible methods, with computational speed comparable to fast penalized regression methods such as the Lasso, and with superior prediction accuracy across a wide range of scenarios. Furthermore, we show that the posterior mean from our method can be interpreted as solving a penalized regression problem, with the precise form of the penalty function being learned from the data by directly solving an optimization problem (rather than being tuned by cross-validation). Our methods are implemented in an R package, mr.ash.alpha, available from https://github.com/stephenslab/mr.ash.alpha

Via

Access Paper or Ask Questions

Non-negative matrix factorization algorithms greatly improve topic model fits

May 27, 2021

Peter Carbonetto, Abhishek Sarkar, Zihao Wang, Matthew Stephens

Figure 1 for Non-negative matrix factorization algorithms greatly improve topic model fits

Figure 2 for Non-negative matrix factorization algorithms greatly improve topic model fits

Figure 3 for Non-negative matrix factorization algorithms greatly improve topic model fits

Figure 4 for Non-negative matrix factorization algorithms greatly improve topic model fits

Abstract:We report on the potential for using algorithms for non-negative matrix factorization (NMF) to improve parameter estimation in topic models. While several papers have studied connections between NMF and topic models, none have suggested leveraging these connections to develop new algorithms for fitting topic models. Importantly, NMF avoids the "sum-to-one" constraints on the topic model parameters, resulting in an optimization problem with simpler structure and more efficient computations. Building on recent advances in optimization algorithms for NMF, we show that first solving the NMF problem then recovering the topic model fit can produce remarkably better fits, and in less time, than standard algorithms for topic models. While we focus primarily on maximum likelihood estimation, we show that this approach also has the potential to improve variational inference for topic models. Our methods are implemented in the R package fastTopics.

* Submitted to Advances in Neural Information Processing Systems 2021

Via

Access Paper or Ask Questions

Nonparametric Bayesian Logic

Jul 04, 2012

Peter Carbonetto, Jacek Kisynski, Nando de Freitas, David L Poole

Figure 1 for Nonparametric Bayesian Logic

Figure 2 for Nonparametric Bayesian Logic

Figure 3 for Nonparametric Bayesian Logic

Figure 4 for Nonparametric Bayesian Logic

Abstract:The Bayesian Logic (BLOG) language was recently developed for defining first-order probability models over worlds with unknown numbers of objects. It handles important problems in AI, including data association and population estimation. This paper extends BLOG by adopting generative processes over function spaces - known as nonparametrics in the Bayesian literature. We introduce syntax for reasoning about arbitrary collections of objects, and their properties, in an intuitive manner. By exploiting exchangeability, distributions over unknown objects and their attributes are cast as Dirichlet processes, which resolve difficulties in model selection and inference caused by varying numbers of objects. We demonstrate these concepts with application to citation matching.

* Appears in Proceedings of the Twenty-First Conference on Uncertainty in Artificial Intelligence (UAI2005)

Via

Access Paper or Ask Questions