Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Roman Moscoviz

UCL, UCL-CS, Inria, Inria-London, MODAL

How good is PAC-Bayes at explaining generalisation?

Mar 11, 2025

Antoine Picard-Weibel, Eugenio Clerico, Roman Moscoviz, Benjamin Guedj

Figure 1 for How good is PAC-Bayes at explaining generalisation?

Figure 2 for How good is PAC-Bayes at explaining generalisation?

Figure 3 for How good is PAC-Bayes at explaining generalisation?

Figure 4 for How good is PAC-Bayes at explaining generalisation?

Abstract:We discuss necessary conditions for a PAC-Bayes bound to provide a meaningful generalisation guarantee. Our analysis reveals that the optimal generalisation guarantee depends solely on the distribution of the risk induced by the prior distribution. In particular, achieving a target generalisation level is only achievable if the prior places sufficient mass on high-performing predictors. We relate these requirements to the prevalent practice of using data-dependent priors in deep learning PAC-Bayes applications, and discuss the implications for the claim that PAC-Bayes ``explains'' generalisation.

Via

Access Paper or Ask Questions

Learning via Surrogate PAC-Bayes

Oct 14, 2024

Antoine Picard-Weibel, Roman Moscoviz, Benjamin Guedj

Figure 1 for Learning via Surrogate PAC-Bayes

Figure 2 for Learning via Surrogate PAC-Bayes

Figure 3 for Learning via Surrogate PAC-Bayes

Figure 4 for Learning via Surrogate PAC-Bayes

Abstract:PAC-Bayes learning is a comprehensive setting for (i) studying the generalisation ability of learning algorithms and (ii) deriving new learning algorithms by optimising a generalisation bound. However, optimising generalisation bounds might not always be viable for tractable or computational reasons, or both. For example, iteratively querying the empirical risk might prove computationally expensive. In response, we introduce a novel principled strategy for building an iterative learning algorithm via the optimisation of a sequence of surrogate training objectives, inherited from PAC-Bayes generalisation bounds. The key argument is to replace the empirical risk (seen as a function of hypotheses) in the generalisation bound by its projection onto a constructible low dimensional functional space: these projections can be queried much more efficiently than the initial risk. On top of providing that generic recipe for learning via surrogate PAC-Bayes bounds, we (i) contribute theoretical results establishing that iteratively optimising our surrogates implies the optimisation of the original generalisation bounds, (ii) instantiate this strategy to the framework of meta-learning, introducing a meta-objective offering a closed form expression for meta-gradient, (iii) illustrate our approach with numerical experiments inspired by an industrial biochemical problem.

* Neurips 2024, Dec 2024, Vancouver, Canada

Via

Access Paper or Ask Questions