Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sergio Calvo-Ordoñez

Observation Noise and Initialization in Wide Neural Networks

Feb 03, 2025

Sergio Calvo-Ordoñez, Jonathan Plenk, Richard Bergna, Alvaro Cartea, Jose Miguel Hernandez-Lobato, Konstantina Palla, Kamil Ciosek

Abstract:Performing gradient descent in a wide neural network is equivalent to computing the posterior mean of a Gaussian Process with the Neural Tangent Kernel (NTK-GP), for a specific choice of prior mean and with zero observation noise. However, existing formulations of this result have two limitations: i) the resultant NTK-GP assumes no noise in the observed target variables, which can result in suboptimal predictions with noisy data; ii) it is unclear how to extend the equivalence to an arbitrary prior mean, a crucial aspect of formulating a well-specified model. To address the first limitation, we introduce a regularizer into the neural network's training objective, formally showing its correspondence to incorporating observation noise into the NTK-GP model. To address the second, we introduce a \textit{shifted network} that enables arbitrary prior mean functions. This approach allows us to perform gradient descent on a single neural network, without expensive ensembling or kernel matrix inversion. Our theoretical insights are validated empirically, with experiments exploring different values of observation noise and network architectures.

* Work under review, 22 pages

Via

Access Paper or Ask Questions

Epistemic Uncertainty and Observation Noise with the Neural Tangent Kernel

Sep 10, 2024

Sergio Calvo-Ordoñez, Konstantina Palla, Kamil Ciosek

Figure 1 for Epistemic Uncertainty and Observation Noise with the Neural Tangent Kernel

Abstract:Recent work has shown that training wide neural networks with gradient descent is formally equivalent to computing the mean of the posterior distribution in a Gaussian Process (GP) with the Neural Tangent Kernel (NTK) as the prior covariance and zero aleatoric noise \parencite{jacot2018neural}. In this paper, we extend this framework in two ways. First, we show how to deal with non-zero aleatoric noise. Second, we derive an estimator for the posterior covariance, giving us a handle on epistemic uncertainty. Our proposed approach integrates seamlessly with standard training pipelines, as it involves training a small number of additional predictors using gradient descent on a mean squared error loss. We demonstrate the proof-of-concept of our method through empirical evaluation on synthetic regression.

* 11 pages including appendix. Fix incorrect author affiliations in the initial revision due to typos

Via

Access Paper or Ask Questions

Uncertainty Modeling in Graph Neural Networks via Stochastic Differential Equations

Sep 01, 2024

Richard Bergna, Sergio Calvo-Ordoñez, Felix L. Opolka, Pietro Liò, Jose Miguel Hernandez-Lobato

Figure 1 for Uncertainty Modeling in Graph Neural Networks via Stochastic Differential Equations

Figure 2 for Uncertainty Modeling in Graph Neural Networks via Stochastic Differential Equations

Figure 3 for Uncertainty Modeling in Graph Neural Networks via Stochastic Differential Equations

Figure 4 for Uncertainty Modeling in Graph Neural Networks via Stochastic Differential Equations

Abstract:We address the problem of learning uncertainty-aware representations for graph-structured data. While Graph Neural Ordinary Differential Equations (GNODE) are effective in learning node representations, they fail to quantify uncertainty. To address this, we introduce Latent Graph Neural Stochastic Differential Equations (LGNSDE), which enhance GNODE by embedding randomness through Brownian motion to quantify uncertainty. We provide theoretical guarantees for LGNSDE and empirically show better performance in uncertainty quantification.

* 9 pages including appendix

Via

Access Paper or Ask Questions