Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jon Cockayne

Randomised Postiterations for Calibrated BayesCG

Apr 05, 2025

Niall Vyas, Disha Hegde, Jon Cockayne

Abstract:The Bayesian conjugate gradient method offers probabilistic solutions to linear systems but suffers from poor calibration, limiting its utility in uncertainty quantification tasks. Recent approaches leveraging postiterations to construct priors have improved computational properties but failed to correct calibration issues. In this work, we propose a novel randomised postiteration strategy that enhances the calibration of the BayesCG posterior while preserving its favourable convergence characteristics. We present theoretical guarantees for the improved calibration, supported by results on the distribution of posterior errors. Numerical experiments demonstrate the efficacy of the method in both synthetic and inverse problem settings, showing enhanced uncertainty quantification and better propagation of uncertainties through computational pipelines.

Via

Access Paper or Ask Questions

Learning to Solve Related Linear Systems

Mar 21, 2025

Disha Hegde, Jon Cockayne

Abstract:Solving multiple parametrised related systems is an essential component of many numerical tasks. Borrowing strength from the solved systems and learning will make this process faster. In this work, we propose a novel probabilistic linear solver over the parameter space. This leverages information from the solved linear systems in a regression setting to provide an efficient posterior mean and covariance. We advocate using this as companion regression model for the preconditioned conjugate gradient method, and discuss the favourable properties of the posterior mean and covariance as the initial guess and preconditioner. We also provide several design choices for this companion solver. Numerical experiments showcase the benefits of using our novel solver in a hyperparameter optimisation problem.

Via

Access Paper or Ask Questions

Calibrated Computation-Aware Gaussian Processes

Oct 11, 2024

Disha Hegde, Mohamed Adil, Jon Cockayne

Figure 1 for Calibrated Computation-Aware Gaussian Processes

Figure 2 for Calibrated Computation-Aware Gaussian Processes

Figure 3 for Calibrated Computation-Aware Gaussian Processes

Figure 4 for Calibrated Computation-Aware Gaussian Processes

Abstract:Gaussian processes are notorious for scaling cubically with the size of the training set, preventing application to very large regression problems. Computation-aware Gaussian processes (CAGPs) tackle this scaling issue by exploiting probabilistic linear solvers to reduce complexity, widening the posterior with additional computational uncertainty due to reduced computation. However, the most commonly used CAGP framework results in (sometimes dramatically) conservative uncertainty quantification, making the posterior unrealistic in practice. In this work, we prove that if the utilised probabilistic linear solver is calibrated, in a rigorous statistical sense, then so too is the induced CAGP. We thus propose a new CAGP framework, CAGP-GS, based on using Gauss-Seidel iterations for the underlying probabilistic linear solver. CAGP-GS performs favourably compared to existing approaches when the test set is low-dimensional and few iterations are performed. We test the calibratedness on a synthetic problem, and compare the performance to existing approaches on a large-scale global temperature regression problem.

Via

Access Paper or Ask Questions

Computation-Aware Kalman Filtering and Smoothing

May 14, 2024

Marvin Pförtner, Jonathan Wenger, Jon Cockayne, Philipp Hennig

Abstract:Kalman filtering and smoothing are the foundational mechanisms for efficient inference in Gauss-Markov models. However, their time and memory complexities scale prohibitively with the size of the state space. This is particularly problematic in spatiotemporal regression problems, where the state dimension scales with the number of spatial observations. Existing approximate frameworks leverage low-rank approximations of the covariance matrix. Since they do not model the error introduced by the computational approximation, their predictive uncertainty estimates can be overly optimistic. In this work, we propose a probabilistic numerical method for inference in high-dimensional Gauss-Markov models which mitigates these scaling issues. Our matrix-free iterative algorithm leverages GPU acceleration and crucially enables a tunable trade-off between computational cost and predictive uncertainty. Finally, we demonstrate the scalability of our method on a large-scale climate dataset.

Via

Access Paper or Ask Questions

Bayesian Numerical Methods for Nonlinear Partial Differential Equations

May 03, 2021

Junyang Wang, Jon Cockayne, Oksana Chkrebtii, T. J. Sullivan, Chris. J. Oates

Figure 1 for Bayesian Numerical Methods for Nonlinear Partial Differential Equations

Figure 2 for Bayesian Numerical Methods for Nonlinear Partial Differential Equations

Figure 3 for Bayesian Numerical Methods for Nonlinear Partial Differential Equations

Figure 4 for Bayesian Numerical Methods for Nonlinear Partial Differential Equations

Abstract:The numerical solution of differential equations can be formulated as an inference problem to which formal statistical approaches can be applied. However, nonlinear partial differential equations (PDEs) pose substantial challenges from an inferential perspective, most notably the absence of explicit conditioning formula. This paper extends earlier work on linear PDEs to a general class of initial value problems specified by nonlinear PDEs, motivated by problems for which evaluations of the right-hand-side, initial conditions, or boundary conditions of the PDE have a high computational cost. The proposed method can be viewed as exact Bayesian inference under an approximate likelihood, which is based on discretisation of the nonlinear differential operator. Proof-of-concept experimental results demonstrate that meaningful probabilistic uncertainty quantification for the unknown solution of the PDE can be performed, while controlling the number of times the right-hand-side, initial and boundary conditions are evaluated. A suitable prior model for the solution of the PDE is identified using novel theoretical analysis of the sample path properties of Mat\'{e}rn processes, which may be of independent interest.

Via

Access Paper or Ask Questions

A Probabilistic Taylor Expansion with Applications in Filtering and Differential Equations

Feb 01, 2021

Toni Karvonen, Jon Cockayne, Filip Tronarp, Simo Särkkä

Figure 1 for A Probabilistic Taylor Expansion with Applications in Filtering and Differential Equations

Figure 2 for A Probabilistic Taylor Expansion with Applications in Filtering and Differential Equations

Figure 3 for A Probabilistic Taylor Expansion with Applications in Filtering and Differential Equations

Figure 4 for A Probabilistic Taylor Expansion with Applications in Filtering and Differential Equations

Abstract:We study a class of Gaussian processes for which the posterior mean, for a particular choice of data, replicates a truncated Taylor expansion of any order. The data consists of derivative evaluations at the expansion point and the prior covariance kernel belongs to the class of Taylor kernels, which can be written in a certain power series form. This permits statistical modelling of the uncertainty in a variety of algorithms that exploit first and second order Taylor expansions. To demonstrate the utility of this Gaussian process model we introduce new probabilistic versions of the classical extended Kalman filter for non-linear state estimation and the Euler method for solving ordinary differential equations.

Via

Access Paper or Ask Questions

Probabilistic Iterative Methods for Linear Systems

Jan 11, 2021

Jon Cockayne, Ilse C. F. Ipsen, Chris J. Oates, Tim W. Reid

Figure 1 for Probabilistic Iterative Methods for Linear Systems

Figure 2 for Probabilistic Iterative Methods for Linear Systems

Figure 3 for Probabilistic Iterative Methods for Linear Systems

Figure 4 for Probabilistic Iterative Methods for Linear Systems

Abstract:This paper presents a probabilistic perspective on iterative methods for approximating the solution $\mathbf{x}_* \in \mathbb{R}^d$ of a nonsingular linear system $\mathbf{A} \mathbf{x}_* = \mathbf{b}$. In the approach a standard iterative method on $\mathbb{R}^d$ is lifted to act on the space of probability distributions $\mathcal{P}(\mathbb{R}^d)$. Classically, an iterative method produces a sequence $\mathbf{x}_m$ of approximations that converge to $\mathbf{x}_*$. The output of the iterative methods proposed in this paper is, instead, a sequence of probability distributions $\mu_m \in \mathcal{P}(\mathbb{R}^d)$. The distributional output both provides a "best guess" for $\mathbf{x}_*$, for example as the mean of $\mu_m$, and also probabilistic uncertainty quantification for the value of $\mathbf{x}_*$ when it has not been exactly determined. Theoretical analysis is provided in the prototypical case of a stationary linear iterative method. In this setting we characterise both the rate of contraction of $\mu_m$ to an atomic measure on $\mathbf{x}_*$ and the nature of the uncertainty quantification being provided. We conclude with an empirical illustration that highlights the insight into solution uncertainty that can be provided by probabilistic iterative methods.

Via

Access Paper or Ask Questions

Probabilistic Gradients for Fast Calibration of Differential Equation Models

Sep 03, 2020

Jon Cockayne, Andrew B. Duncan

Figure 1 for Probabilistic Gradients for Fast Calibration of Differential Equation Models

Figure 2 for Probabilistic Gradients for Fast Calibration of Differential Equation Models

Figure 3 for Probabilistic Gradients for Fast Calibration of Differential Equation Models

Abstract:Calibration of large-scale differential equation models to observational or experimental data is a widespread challenge throughout applied sciences and engineering. A crucial bottleneck in state-of-the art calibration methods is the calculation of local sensitivities, i.e. derivatives of the loss function with respect to the estimated parameters, which often necessitates several numerical solves of the underlying system of partial or ordinary differential equations. In this paper we present a new probabilistic approach to computing local sensitivities. The proposed method has several advantages over classical methods. Firstly, it operates within a constrained computational budget and provides a probabilistic quantification of uncertainty incurred in the sensitivities from this constraint. Secondly, information from previous sensitivity estimates can be recycled in subsequent computations, reducing the overall computational effort for iterative gradient-based calibration methods. The methodology presented is applied to two challenging test problems and compared against classical methods.

Via

Access Paper or Ask Questions

Optimal Thinning of MCMC Output

May 08, 2020

Marina Riabiz, Wilson Chen, Jon Cockayne, Pawel Swietach, Steven A. Niederer, Lester Mackey, Chris. J. Oates

Figure 1 for Optimal Thinning of MCMC Output

Figure 2 for Optimal Thinning of MCMC Output

Figure 3 for Optimal Thinning of MCMC Output

Figure 4 for Optimal Thinning of MCMC Output

Abstract:The use of heuristics to assess the convergence and compress the output of Markov chain Monte Carlo can be sub-optimal in terms of the empirical approximations that are produced. Typically a number of the initial states are attributed to "burn in" and removed, whilst the chain can be "thinned" if compression is also required. In this paper we consider the problem of selecting a subset of states, of fixed cardinality, such that the approximation provided by their empirical distribution is close to optimal. A novel method is proposed, based on greedy minimisation of a kernel Stein discrepancy, that is suitable for problems where heavy compression is required. Theoretical results guarantee consistency of the method and its effectiveness is demonstrated in the challenging context of parameter inference for ordinary differential equations. Software is available in the "Stein Thinning" package in both Python and MATLAB, and example code is included.

Via

Access Paper or Ask Questions

On the Sampling Problem for Kernel Quadrature

Jun 11, 2017

Francois-Xavier Briol, Chris J. Oates, Jon Cockayne, Wilson Ye Chen, Mark Girolami

Figure 1 for On the Sampling Problem for Kernel Quadrature

Figure 2 for On the Sampling Problem for Kernel Quadrature

Figure 3 for On the Sampling Problem for Kernel Quadrature

Figure 4 for On the Sampling Problem for Kernel Quadrature

Abstract:The standard Kernel Quadrature method for numerical integration with random point sets (also called Bayesian Monte Carlo) is known to converge in root mean square error at a rate determined by the ratio $s/d$, where $s$ and $d$ encode the smoothness and dimension of the integrand. However, an empirical investigation reveals that the rate constant $C$ is highly sensitive to the distribution of the random points. In contrast to standard Monte Carlo integration, for which optimal importance sampling is well-understood, the sampling distribution that minimises $C$ for Kernel Quadrature does not admit a closed form. This paper argues that the practical choice of sampling distribution is an important open problem. One solution is considered; a novel automatic approach based on adaptive tempering and sequential Monte Carlo. Empirical results demonstrate a dramatic reduction in integration error of up to 4 orders of magnitude can be achieved with the proposed method.

* Proceedings of the 34th International Conference on Machine Learning, PMLR 70:586-595, 2017
* To appear at Thirty-fourth International Conference on Machine Learning (ICML 2017)

Via

Access Paper or Ask Questions