Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alexis Boukouvalas

Aston University

Additive Gaussian Processes Revisited

Jun 20, 2022

Xiaoyu Lu, Alexis Boukouvalas, James Hensman

Figure 1 for Additive Gaussian Processes Revisited

Figure 2 for Additive Gaussian Processes Revisited

Figure 3 for Additive Gaussian Processes Revisited

Figure 4 for Additive Gaussian Processes Revisited

Abstract:Gaussian Process (GP) models are a class of flexible non-parametric models that have rich representational power. By using a Gaussian process with additive structure, complex responses can be modelled whilst retaining interpretability. Previous work showed that additive Gaussian process models require high-dimensional interaction terms. We propose the orthogonal additive kernel (OAK), which imposes an orthogonality constraint on the additive functions, enabling an identifiable, low-dimensional representation of the functional relationship. We connect the OAK kernel to functional ANOVA decomposition, and show improved convergence rates for sparse computation methods. With only a small number of additive low-dimensional terms, we demonstrate the OAK model achieves similar or better predictive performance compared to black-box models, while retaining interpretability.

* 39th International Conference on Machine Learning (ICML 2022)

Via

Access Paper or Ask Questions

The Minecraft Kernel: Modelling correlated Gaussian Processes in the Fourier domain

Mar 11, 2021

Fergus Simpson, Alexis Boukouvalas, Vaclav Cadek, Elvijs Sarkans, Nicolas Durrande

Figure 1 for The Minecraft Kernel: Modelling correlated Gaussian Processes in the Fourier domain

Figure 2 for The Minecraft Kernel: Modelling correlated Gaussian Processes in the Fourier domain

Figure 3 for The Minecraft Kernel: Modelling correlated Gaussian Processes in the Fourier domain

Figure 4 for The Minecraft Kernel: Modelling correlated Gaussian Processes in the Fourier domain

Abstract:In the univariate setting, using the kernel spectral representation is an appealing approach for generating stationary covariance functions. However, performing the same task for multiple-output Gaussian processes is substantially more challenging. We demonstrate that current approaches to modelling cross-covariances with a spectral mixture kernel possess a critical blind spot. For a given pair of processes, the cross-covariance is not reproducible across the full range of permitted correlations, aside from the special case where their spectral densities are of identical shape. We present a solution to this issue by replacing the conventional Gaussian components of a spectral mixture with block components of finite bandwidth (i.e. rectangular step functions). The proposed family of kernel represents the first multi-output generalisation of the spectral mixture kernel that can approximate any stationary multi-output kernel to arbitrary precision.

* Artificial Intelligence and Statistics, 2021

Via

Access Paper or Ask Questions

Enriched Mixtures of Gaussian Process Experts

May 30, 2019

Charles W. L. Gadd, Sara Wade, Alexis Boukouvalas

Figure 1 for Enriched Mixtures of Gaussian Process Experts

Figure 2 for Enriched Mixtures of Gaussian Process Experts

Figure 3 for Enriched Mixtures of Gaussian Process Experts

Figure 4 for Enriched Mixtures of Gaussian Process Experts

Abstract:Mixtures of experts probabilistically divide the input space into regions, where the assumptions of each expert, or conditional model, need only hold locally. Combined with Gaussian process (GP) experts, this results in a powerful and highly flexible model. We focus on alternative mixtures of GP experts, which model the joint distribution of the inputs and targets explicitly. We highlight issues of this approach in multi-dimensional input spaces, namely, poor scalability and the need for an unnecessarily large number of experts, degrading the predictive performance and increasing uncertainty. We construct a novel model to address these issues through a nested partitioning scheme that automatically infers the number of components at both levels. Multiple response types are accommodated through a generalised GP framework, while multiple input types are included through a factorised exponential family structure. We show the effectiveness of our approach in estimating a parsimonious probabilistic description of both synthetic data of increasing dimension and an Alzheimer's challenge dataset.

Via

Access Paper or Ask Questions

Adaptive Sensor Placement for Continuous Spaces

May 16, 2019

James A Grant, Alexis Boukouvalas, Ryan-Rhys Griffiths, David S Leslie, Sattar Vakili, Enrique Munoz de Cote

Figure 1 for Adaptive Sensor Placement for Continuous Spaces

Figure 2 for Adaptive Sensor Placement for Continuous Spaces

Figure 3 for Adaptive Sensor Placement for Continuous Spaces

Figure 4 for Adaptive Sensor Placement for Continuous Spaces

Abstract:We consider the problem of adaptively placing sensors along an interval to detect stochastically-generated events. We present a new formulation of the problem as a continuum-armed bandit problem with feedback in the form of partial observations of realisations of an inhomogeneous Poisson process. We design a solution method by combining Thompson sampling with nonparametric inference via increasingly granular Bayesian histograms and derive an $\tilde{O}(T^{2/3})$ bound on the Bayesian regret in $T$ rounds. This is coupled with the design of an efficent optimisation approach to select actions in polynomial time. In simulations we demonstrate our approach to have substantially lower and less variable regret than competitor algorithms.

* 13 pages, accepted to ICML 2019

Via

Access Paper or Ask Questions

Decision Variance in Online Learning

Jul 24, 2018

Sattar Vakili, Alexis Boukouvalas

Figure 1 for Decision Variance in Online Learning

Abstract:Online learning has classically focused on the expected behaviour of learning policies. Recently, risk-averse online learning has gained much attention. In this paper, a risk-averse multi-armed bandit problem where the performance of policies is measured based on the mean-variance of the rewards is studied. The variance of the rewards depends on the variance of the underlying processes as well as the variance of the player's decisions. The performance of two existing policies is analyzed and new fundamental limitations on risk-averse learning is established. In particular, it is shown that although an $\mathcal{O}(\log T)$ distribution-dependent regret in time $T$ is achievable (similar to the risk-neutral setting), the worst-case (i.e. minimax) regret is lower bounded by $\Omega(T)$ (in contrast to the $\Omega(\sqrt{T})$ lower bound in the risk-neutral setting). The lower bound results are even stronger in the sense that they are proven for the case of online learning with full feedback.

Via

Access Paper or Ask Questions

GPflow: A Gaussian process library using TensorFlow

Oct 27, 2016

Alexander G. de G. Matthews, Mark van der Wilk, Tom Nickson, Keisuke Fujii, Alexis Boukouvalas, Pablo León-Villagrá, Zoubin Ghahramani, James Hensman

Figure 1 for GPflow: A Gaussian process library using TensorFlow

Figure 2 for GPflow: A Gaussian process library using TensorFlow

Figure 3 for GPflow: A Gaussian process library using TensorFlow

Abstract:GPflow is a Gaussian process library that uses TensorFlow for its core computations and Python for its front end. The distinguishing features of GPflow are that it uses variational inference as the primary approximation method, provides concise code through the use of automatic differentiation, has been engineered with a particular emphasis on software testing and is able to exploit GPU hardware.

Via

Access Paper or Ask Questions

Simple approximate MAP Inference for Dirichlet processes

Nov 04, 2014

Yordan P. Raykov, Alexis Boukouvalas, Max A. Little

Figure 1 for Simple approximate MAP Inference for Dirichlet processes

Figure 2 for Simple approximate MAP Inference for Dirichlet processes

Figure 3 for Simple approximate MAP Inference for Dirichlet processes

Figure 4 for Simple approximate MAP Inference for Dirichlet processes

Abstract:The Dirichlet process mixture (DPM) is a ubiquitous, flexible Bayesian nonparametric statistical model. However, full probabilistic inference in this model is analytically intractable, so that computationally intensive techniques such as Gibb's sampling are required. As a result, DPM-based methods, which have considerable potential, are restricted to applications in which computational resources and time for inference is plentiful. For example, they would not be practical for digital signal processing on embedded hardware, where computational resources are at a serious premium. Here, we develop simplified yet statistically rigorous approximate maximum a-posteriori (MAP) inference algorithms for DPMs. This algorithm is as simple as K-means clustering, performs in experiments as well as Gibb's sampling, while requiring only a fraction of the computational effort. Unlike related small variance asymptotics, our algorithm is non-degenerate and so inherits the "rich get richer" property of the Dirichlet process. It also retains a non-degenerate closed-form likelihood which enables standard tools such as cross-validation to be used. This is a well-posed approximation to the MAP solution of the probabilistic DPM model.

* 11 pages, 4 Figures, 5 Tables

Via

Access Paper or Ask Questions

Gaussian Process Quantile Regression using Expectation Propagation

Jun 27, 2012

Alexis Boukouvalas, Remi Barillec, Dan Cornford

Figure 1 for Gaussian Process Quantile Regression using Expectation Propagation

Figure 2 for Gaussian Process Quantile Regression using Expectation Propagation

Figure 3 for Gaussian Process Quantile Regression using Expectation Propagation

Figure 4 for Gaussian Process Quantile Regression using Expectation Propagation

Abstract:Direct quantile regression involves estimating a given quantile of a response variable as a function of input variables. We present a new framework for direct quantile regression where a Gaussian process model is learned, minimising the expected tilted loss function. The integration required in learning is not analytically tractable so to speed up the learning we employ the Expectation Propagation algorithm. We describe how this work relates to other quantile regression methods and apply the method on both synthetic and real data sets. The method is shown to be competitive with state of the art methods whilst allowing for the leverage of the full Gaussian process probabilistic framework.

* Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

Via

Access Paper or Ask Questions