Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Emmanuel Vazquez

L2S, RT-UQ

Bayesian Active Learning of (small) Quantile Sets through Expected Estimator Modification

Jun 16, 2025

Romain Ait Abdelmalek-Lomenech, Julien Bect, Emmanuel Vazquez

Abstract:Given a multivariate function taking deterministic and uncertain inputs, we consider the problem of estimating a quantile set: a set of deterministic inputs for which the probability that the output belongs to a specific region remains below a given threshold. To solve this problem in the context of expensive-to-evaluate black-box functions, we propose a Bayesian active learning strategy based on Gaussian process modeling. The strategy is driven by a novel sampling criterion, which belongs to a broader principle that we refer to as Expected Estimator Modification (EEM). More specifically, the strategy relies on a novel sampling criterion combined with a sequential Monte Carlo framework that enables the construction of batch-sequential designs for the efficient estimation of small quantile sets. The performance of the strategy is illustrated on several synthetic examples and an industrial application case involving the ROTOR37 compressor model.

Via

Access Paper or Ask Questions

Gaussian process interpolation with conformal prediction: methods and comparative analysis

Jul 11, 2024

Aurélien Pion, Emmanuel Vazquez

Abstract:This article advocates the use of conformal prediction (CP) methods for Gaussian process (GP) interpolation to enhance the calibration of prediction intervals. We begin by illustrating that using a GP model with parameters selected by maximum likelihood often results in predictions that are not optimally calibrated. CP methods can adjust the prediction intervals, leading to better uncertainty quantification while maintaining the accuracy of the underlying GP model. We compare different CP variants and introduce a novel variant based on an asymmetric score. Our numerical experiments demonstrate the effectiveness of CP methods in improving calibration without compromising accuracy. This work aims to facilitate the adoption of CP methods in the GP community.

* LOD 2024, 10th International Conference on Machine Learning, Optimization, and Data Science, Sep 2024, Castiglione della Pescaia Grosseto Italy, Italy

Via

Access Paper or Ask Questions

Bayesian sequential design of computer experiments to estimate reliable sets

Nov 02, 2022

Romain Ait Abdelmalek-Lomenech, Julien Bect, Vincent Chabridon, Emmanuel Vazquez

Figure 1 for Bayesian sequential design of computer experiments to estimate reliable sets

Figure 2 for Bayesian sequential design of computer experiments to estimate reliable sets

Figure 3 for Bayesian sequential design of computer experiments to estimate reliable sets

Figure 4 for Bayesian sequential design of computer experiments to estimate reliable sets

Abstract:We consider an unknown multivariate function representing a system-such as a complex numerical simulator-taking both deterministic and uncertain inputs. Our objective is to estimate the set of deterministic inputs leading to outputs whose probability (with respect to the distribution of the uncertain inputs) to belong to a given set is controlled by a given threshold. To solve this problem, we propose a Bayesian strategy based on the Stepwise Uncertainty Reduction (SUR) principle to sequentially choose the points at which the function should be evaluated to approximate the set of interest. We illustrate its performance and interest in several numerical experiments.

Via

Access Paper or Ask Questions

Bayesian multi-objective optimization for stochastic simulators: an extension of the Pareto Active Learning method

Jul 08, 2022

Bruno Barracosa, Julien Bect, Héloïse Dutrieux Baraffe, Juliette Morin, Josselin Fournel, Emmanuel Vazquez

Figure 1 for Bayesian multi-objective optimization for stochastic simulators: an extension of the Pareto Active Learning method

Figure 2 for Bayesian multi-objective optimization for stochastic simulators: an extension of the Pareto Active Learning method

Figure 3 for Bayesian multi-objective optimization for stochastic simulators: an extension of the Pareto Active Learning method

Figure 4 for Bayesian multi-objective optimization for stochastic simulators: an extension of the Pareto Active Learning method

Abstract:This article focuses on the multi-objective optimization of stochastic simulators with high output variance, where the input space is finite and the objective functions are expensive to evaluate. We rely on Bayesian optimization algorithms, which use probabilistic models to make predictions about the functions to be optimized. The proposed approach is an extension of the Pareto Active Learning (PAL) algorithm for the estimation of Pareto-optimal solutions that makes it suitable for the stochastic setting. We named it Pareto Active Learning for Stochastic Simulators (PALS). The performance of PALS is assessed through numerical experiments over a set of bi-dimensional, bi-objective test problems. PALS exhibits superior performance when compared to other scalarization-based and random-search approaches.

Via

Access Paper or Ask Questions

Relaxed Gaussian process interpolation: a goal-oriented approach to Bayesian optimization

Jun 07, 2022

Sébastien Petit, Julien Bect, Emmanuel Vazquez

Figure 1 for Relaxed Gaussian process interpolation: a goal-oriented approach to Bayesian optimization

Figure 2 for Relaxed Gaussian process interpolation: a goal-oriented approach to Bayesian optimization

Figure 3 for Relaxed Gaussian process interpolation: a goal-oriented approach to Bayesian optimization

Figure 4 for Relaxed Gaussian process interpolation: a goal-oriented approach to Bayesian optimization

Abstract:This work presents a new procedure for obtaining predictive distributions in the context of Gaussian process (GP) modeling, with a relaxation of the interpolation constraints outside some ranges of interest: the mean of the predictive distributions no longer necessarily interpolates the observed values when they are outside ranges of interest, but are simply constrained to remain outside. This method called relaxed Gaussian process (reGP) interpolation provides better predictive distributions in ranges of interest, especially in cases where a stationarity assumption for the GP model is not appropriate. It can be viewed as a goal-oriented method and becomes particularly interesting in Bayesian optimization, for example, for the minimization of an objective function, where good predictive distributions for low function values are important. When the expected improvement criterion and reGP are used for sequentially choosing evaluation points, the convergence of the resulting optimization algorithm is theoretically guaranteed (provided that the function to be optimized lies in the reproducing kernel Hilbert spaces attached to the known covariance of the underlying Gaussian process). Experiments indicate that using reGP instead of stationary GP models in Bayesian optimization is beneficial.

Via

Access Paper or Ask Questions

Gaussian process interpolation: the choice of the family of models is more important than that of the selection criterion

Jul 13, 2021

Sébastien Petit, Julien Bect, Paul Feliot, Emmanuel Vazquez

Figure 1 for Gaussian process interpolation: the choice of the family of models is more important than that of the selection criterion

Figure 2 for Gaussian process interpolation: the choice of the family of models is more important than that of the selection criterion

Figure 3 for Gaussian process interpolation: the choice of the family of models is more important than that of the selection criterion

Figure 4 for Gaussian process interpolation: the choice of the family of models is more important than that of the selection criterion

Abstract:This article revisits the fundamental problem of parameter selection for Gaussian process interpolation. By choosing the mean and the covariance functions of a Gaussian process within parametric families, the user obtains a family of Bayesian procedures to perform predictions about the unknown function, and must choose a member of the family that will hopefully provide good predictive performances. We base our study on the general concept of scoring rules, which provides an effective framework for building leave-one-out selection and validation criteria, and a notion of extended likelihood criteria based on an idea proposed by Fasshauer and co-authors in 2009, which makes it possible to recover standard selection criteria such as, for instance, the generalized cross-validation criterion. Under this setting, we empirically show on several test problems of the literature that the choice of an appropriate family of models is often more important than the choice of a particular selection criterion (e.g., the likelihood versus a leave-one-out selection criterion). Moreover, our numerical results show that the regularity parameter of a Mat{\'e}rn covariance can be selected effectively by most selection criteria.

Via

Access Paper or Ask Questions

Numerical issues in maximum likelihood parameter estimation for Gaussian process regression

Jan 24, 2021

Subhasish Basak, Sébastien Petit, Julien Bect, Emmanuel Vazquez

Figure 1 for Numerical issues in maximum likelihood parameter estimation for Gaussian process regression

Figure 2 for Numerical issues in maximum likelihood parameter estimation for Gaussian process regression

Figure 3 for Numerical issues in maximum likelihood parameter estimation for Gaussian process regression

Figure 4 for Numerical issues in maximum likelihood parameter estimation for Gaussian process regression

Abstract:This article focuses on numerical issues in maximum likelihood parameter estimation for Gaussian process regression (GPR). This article investigates the origin of the numerical issues and provides simple but effective improvement strategies. This work targets a basic problem but a host of studies, particularly in the literature of Bayesian optimization, rely on off-the-shelf GPR implementations. For the conclusions of these studies to be reliable and reproducible, robust GPR implementations are critical.

Via

Access Paper or Ask Questions

Sequential design of multi-fidelity computer experiments: maximizing the rate of stepwise uncertainty reduction

Jul 27, 2020

Rémi Stroh, Julien Bect, Séverine Demeyer, Nicolas Fischer, Damien Marquis, Emmanuel Vazquez

Figure 1 for Sequential design of multi-fidelity computer experiments: maximizing the rate of stepwise uncertainty reduction

Figure 2 for Sequential design of multi-fidelity computer experiments: maximizing the rate of stepwise uncertainty reduction

Figure 3 for Sequential design of multi-fidelity computer experiments: maximizing the rate of stepwise uncertainty reduction

Figure 4 for Sequential design of multi-fidelity computer experiments: maximizing the rate of stepwise uncertainty reduction

Abstract:This article deals with the sequential design of experiments for (deterministic or stochastic) multi-fidelity numerical simulators, that is, simulators that offer control over the accuracy of simulation of the physical phenomenon or system under study. Very often, accurate simulations correspond to high computational efforts whereas coarse simulations can be obtained at a smaller cost. In this setting, simulation results obtained at several levels of fidelity can be combined in order to estimate quantities of interest (the optimal value of the output, the probability that the output exceeds a given threshold...) in an efficient manner. To do so, we propose a new Bayesian sequential strategy called Maximal Rate of Stepwise Uncertainty Reduction (MR-SUR), that selects additional simulations to be performed by maximizing the ratio between the expected reduction of uncertainty and the cost of simulation. This generic strategy unifies several existing methods, and provides a principled approach to develop new ones. We assess its performance on several examples, including a computationally intensive problem of fire safety analysis where the quantity of interest is the probability of exceeding a tenability threshold during a building fire.

Via

Access Paper or Ask Questions

Towards new cross-validation-based estimators for Gaussian process regression: efficient adjoint computation of gradients

Feb 26, 2020

Sébastien Petit, Julien Bect, Sébastien da Veiga, Paul Feliot, Emmanuel Vazquez

Figure 1 for Towards new cross-validation-based estimators for Gaussian process regression: efficient adjoint computation of gradients

Abstract:We consider the problem of estimating the parameters of the covariance function of a Gaussian process by cross-validation. We suggest using new cross-validation criteria derived from the literature of scoring rules. We also provide an efficient method for computing the gradient of a cross-validation criterion. To the best of our knowledge, our method is more efficient than what has been proposed in the literature so far. It makes it possible to lower the complexity of jointly evaluating leave-one-out criteria and their gradients.

Via

Access Paper or Ask Questions

Sequential design of experiments to estimate a probability of exceeding a threshold in a multi-fidelity stochastic simulator

Jul 26, 2017

Rémi Stroh, Séverine Demeyer, Nicolas Fischer, Julien Bect, Emmanuel Vazquez

Figure 1 for Sequential design of experiments to estimate a probability of exceeding a threshold in a multi-fidelity stochastic simulator

Figure 2 for Sequential design of experiments to estimate a probability of exceeding a threshold in a multi-fidelity stochastic simulator

Abstract:In this article, we consider a stochastic numerical simulator to assess the impact of some factors on a phenomenon. The simulator is seen as a black box with inputs and outputs. The quality of a simulation, hereafter referred to as fidelity, is assumed to be tunable by means of an additional input of the simulator (e.g., a mesh size parameter): high-fidelity simulations provide more accurate results, but are time-consuming. Using a limited computation-time budget, we want to estimate, for any value of the physical inputs, the probability that a certain scalar output of the simulator will exceed a given critical threshold at the highest fidelity level. The problem is addressed in a Bayesian framework, using a Gaussian process model of the multi-fidelity simulator. We consider a Bayesian estimator of the probability, together with an associated measure of uncertainty, and propose a new multi-fidelity sequential design strategy, called Maximum Speed of Uncertainty Reduction (MSUR), to select the value of physical inputs and the fidelity level of new simulations. The MSUR strategy is tested on an example.

* 61th World Statistics Congress of the International Statistical Institute (ISI 2017), Jul 2017, Marrakech, Morocco

Via

Access Paper or Ask Questions