Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Annie Sauer

Triangulation candidates for Bayesian optimization

Dec 14, 2021

Robert B. Gramacy, Annie Sauer, Nathan Wycoff

Figure 1 for Triangulation candidates for Bayesian optimization

Figure 2 for Triangulation candidates for Bayesian optimization

Figure 3 for Triangulation candidates for Bayesian optimization

Figure 4 for Triangulation candidates for Bayesian optimization

Abstract:Bayesian optimization is a form of sequential design: idealize input-output relationships with a suitably flexible nonlinear regression model; fit to data from an initial experimental campaign; devise and optimize a criterion for selecting the next experimental condition(s) under the fitted model (e.g., via predictive equations) to target outcomes of interest (say minima); repeat after acquiring output under those conditions and updating the fit. In many situations this "inner optimization" over the new-data acquisition criterion is cumbersome because it is non-convex/highly multi-modal, may be non-differentiable, or may otherwise thwart numerical optimizers, especially when inference requires Monte Carlo. In such cases it is not uncommon to replace continuous search with a discrete one over random candidates. Here we propose using candidates based on a Delaunay triangulation of the existing input design. In addition to detailing construction of these "tricands", based on a simple wrapper around a conventional convex hull library, we promote several advantages based on properties of the geometric criterion involved. We then demonstrate empirically how tricands can lead to better Bayesian optimization performance compared to both numerically optimized acquisitions and random candidate-based alternatives on benchmark problems.

* 19 pages, 9 figures

Via

Access Paper or Ask Questions

Active Learning for Deep Gaussian Process Surrogates

Dec 15, 2020

Annie Sauer, Robert B. Gramacy, David Higdon

Figure 1 for Active Learning for Deep Gaussian Process Surrogates

Figure 2 for Active Learning for Deep Gaussian Process Surrogates

Figure 3 for Active Learning for Deep Gaussian Process Surrogates

Figure 4 for Active Learning for Deep Gaussian Process Surrogates

Abstract:Deep Gaussian processes (DGPs) are increasingly popular as predictive models in machine learning (ML) for their non-stationary flexibility and ability to cope with abrupt regime changes in training data. Here we explore DGPs as surrogates for computer simulation experiments whose response surfaces exhibit similar characteristics. In particular, we transport a DGP's automatic warping of the input space and full uncertainty quantification (UQ), via a novel elliptical slice sampling (ESS) Bayesian posterior inferential scheme, through to active learning (AL) strategies that distribute runs non-uniformly in the input space -- something an ordinary (stationary) GP could not do. Building up the design sequentially in this way allows smaller training sets, limiting both expensive evaluation of the simulator code and mitigating cubic costs of DGP inference. When training data sizes are kept small through careful acquisition, and with parsimonious layout of latent layers, the framework can be both effective and computationally tractable. Our methods are illustrated on simulation data and two real computer experiments of varying input dimensionality. We provide an open source implementation in the "deepgp" package on CRAN.

* 34 pages, 13 figures

Via

Access Paper or Ask Questions