Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Nick Polson

Generative Bayesian Computation for Maximum Expected Utility

Aug 28, 2024

Nick Polson, Fabrizio Ruggeri, Vadim Sokolov

Figure 1 for Generative Bayesian Computation for Maximum Expected Utility

Figure 2 for Generative Bayesian Computation for Maximum Expected Utility

Abstract:Generative Bayesian Computation (GBC) methods are developed to provide an efficient computational solution for maximum expected utility (MEU). We propose a density-free generative method based on quantiles that naturally calculates expected utility as a marginal of quantiles. Our approach uses a deep quantile neural estimator to directly estimate distributional utilities. Generative methods assume only the ability to simulate from the model and parameters and as such are likelihood-free. A large training dataset is generated from parameters and output together with a base distribution. Our method a number of computational advantages primarily being density-free with an efficient estimator of expected utility. A link with the dual theory of expected utility and risk taking is also discussed. To illustrate our methodology, we solve an optimal portfolio allocation problem with Bayesian learning and a power utility (a.k.a. fractional Kelly criterion). Finally, we conclude with directions for future research.

Via

Access Paper or Ask Questions

Deep Learning: A Tutorial

Oct 10, 2023

Nick Polson, Vadim Sokolov

Abstract:Our goal is to provide a review of deep learning methods which provide insight into structured high-dimensional data. Rather than using shallow additive architectures common to most statistical models, deep learning uses layers of semi-affine input transformations to provide a predictive rule. Applying these layers of transformations leads to a set of attributes (or, features) to which probabilistic statistical methods can be applied. Thus, the best of both worlds can be achieved: scalable prediction rules fortified with uncertainty quantification, where sparse regularization finds the features.

* arXiv admin note: text overlap with arXiv:1808.08618

Via

Access Paper or Ask Questions

Quantum Bayes AI

Aug 17, 2022

Nick Polson, Vadim Sokolov, Jianeng Xu

Abstract:Quantum Bayesian AI (Q-B) is an emerging field that levers the computational gains available in Quantum computing. The promise is an exponential speed-up in many Bayesian algorithms. Our goal is to apply these methods directly to statistical and machine learning problems. We provide a duality between classical and quantum probability for calculating of posterior quantities of interest. Our framework unifies MCMC, Deep Learning and Quantum Learning calculations from the viewpoint from von Neumann's principle of quantum measurement. Quantum embeddings and neural gates are also an important part of data encoding and feature selection. There is a natural duality with well-known kernel methods in statistical learning. We illustrate the behaviour of quantum algorithms on two simple classification algorithms. Finally, we conclude with directions for future research.

Via

Access Paper or Ask Questions

Merging Two Cultures: Deep and Statistical Learning

Oct 22, 2021

Anindya Bhadra, Jyotishka Datta, Nick Polson, Vadim Sokolov, Jianeng Xu

Figure 1 for Merging Two Cultures: Deep and Statistical Learning

Figure 2 for Merging Two Cultures: Deep and Statistical Learning

Figure 3 for Merging Two Cultures: Deep and Statistical Learning

Figure 4 for Merging Two Cultures: Deep and Statistical Learning

Abstract:Merging the two cultures of deep and statistical learning provides insights into structured high-dimensional data. Traditional statistical modeling is still a dominant strategy for structured tabular data. Deep learning can be viewed through the lens of generalized linear models (GLMs) with composite link functions. Sufficient dimensionality reduction (SDR) and sparsity performs nonlinear feature engineering. We show that prediction, interpolation and uncertainty quantification can be achieved using probabilistic methods at the output layer of the model. Thus a general framework for machine learning arises that first generates nonlinear features (a.k.a factors) via sparse regularization and stochastic gradient optimisation and second uses a stochastic output layer for predictive uncertainty. Rather than using shallow additive architectures as in many statistical models, deep learning uses layers of semi affine input transformations to provide a predictive rule. Applying these layers of transformations leads to a set of attributes (a.k.a features) to which predictive statistical methods can be applied. Thus we achieve the best of both worlds: scalability and fast predictive rule construction together with uncertainty quantification. Sparse regularisation with un-supervised or supervised learning finds the features. We clarify the duality between shallow and wide models such as PCA, PPR, RRR and deep but skinny architectures such as autoencoders, MLPs, CNN, and LSTM. The connection with data transformations is of practical importance for finding good network architectures. By incorporating probabilistic components at the output level we allow for predictive uncertainty. For interpolation we use deep Gaussian process and ReLU trees for classification. We provide applications to regression, classification and interpolation. Finally, we conclude with directions for future research.

* arXiv admin note: text overlap with arXiv:2106.14085

Via

Access Paper or Ask Questions

Chess AI: Competing Paradigms for Machine Intelligence

Sep 23, 2021

Shiva Maharaj, Nick Polson, Alex Turk

Figure 1 for Chess AI: Competing Paradigms for Machine Intelligence

Figure 2 for Chess AI: Competing Paradigms for Machine Intelligence

Figure 3 for Chess AI: Competing Paradigms for Machine Intelligence

Figure 4 for Chess AI: Competing Paradigms for Machine Intelligence

Abstract:Endgame studies have long served as a tool for testing human creativity and intelligence. We find that they can serve as a tool for testing machine ability as well. Two of the leading chess engines, Stockfish and Leela Chess Zero (LCZero), employ significantly different methods during play. We use Plaskett's Puzzle, a famous endgame study from the late 1970s, to compare the two engines. Our experiments show that Stockfish outperforms LCZero on the puzzle. We examine the algorithmic differences between the engines and use our observations as a basis for carefully interpreting the test results. Drawing inspiration from how humans solve chess problems, we ask whether machines can possess a form of imagination. On the theoretical side, we describe how Bellman's equation may be applied to optimize the probability of winning. To conclude, we discuss the implications of our work on artificial intelligence (AI) and artificial general intelligence (AGI), suggesting possible avenues for future research.

* 15 pages, 8 figures

Via

Access Paper or Ask Questions

Karpov's Queen Sacrifices and AI

Sep 15, 2021

Shiva Maharaj, Nick Polson

Figure 1 for Karpov's Queen Sacrifices and AI

Figure 2 for Karpov's Queen Sacrifices and AI

Figure 3 for Karpov's Queen Sacrifices and AI

Figure 4 for Karpov's Queen Sacrifices and AI

Abstract:Anatoly Karpov's Queen sacrifices are analyzed. Stockfish 14 NNUE -- an AI chess engine -- evaluates how efficient Karpov's sacrifices are. For comparative purposes, we provide a dataset on Karpov's Rook and Knight sacrifices to test whether Karpov achieves a similar level of accuracy. Our study has implications for human-AI interaction and how humans can better understand the strategies employed by black-box AI algorithms. Finally, we conclude with implications for human study in. chess with computer engines.

Via

Access Paper or Ask Questions