Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Jonathan E. Fieldsend

PRoA: A Probabilistic Robustness Assessment against Functional Perturbations

Jul 05, 2022

Tianle Zhang, Wenjie Ruan, Jonathan E. Fieldsend

Figure 1 for PRoA: A Probabilistic Robustness Assessment against Functional Perturbations

Figure 2 for PRoA: A Probabilistic Robustness Assessment against Functional Perturbations

Figure 3 for PRoA: A Probabilistic Robustness Assessment against Functional Perturbations

Figure 4 for PRoA: A Probabilistic Robustness Assessment against Functional Perturbations

Abstract:In safety-critical deep learning applications robustness measurement is a vital pre-deployment phase. However, existing robustness verification methods are not sufficiently practical for deploying machine learning systems in the real world. On the one hand, these methods attempt to claim that no perturbations can ``fool'' deep neural networks (DNNs), which may be too stringent in practice. On the other hand, existing works rigorously consider $L_p$ bounded additive perturbations on the pixel space, although perturbations, such as colour shifting and geometric transformations, are more practically and frequently occurring in the real world. Thus, from the practical standpoint, we present a novel and general {\it probabilistic robustness assessment method} (PRoA) based on the adaptive concentration, and it can measure the robustness of deep learning models against functional perturbations. PRoA can provide statistical guarantees on the probabilistic robustness of a model, \textit{i.e.}, the probability of failure encountered by the trained model after deployment. Our experiments demonstrate the effectiveness and flexibility of PRoA in terms of evaluating the probabilistic robustness against a broad range of functional perturbations, and PRoA can scale well to various large-scale deep neural networks compared to existing state-of-the-art baselines. For the purpose of reproducibility, we release our tool on GitHub: \url{ https://github.com/TrustAI/PRoA}.

* The short version of this work will appear in the Proceedings of the 2022 European Conference on Machine Learning and Data Mining (ECML-PKDD 2022)

Via

Access Paper or Ask Questions

Variational Autoencoders Without the Variation

Mar 01, 2022

Gregory A. Daly, Jonathan E. Fieldsend, Gavin Tabor

Figure 1 for Variational Autoencoders Without the Variation

Figure 2 for Variational Autoencoders Without the Variation

Figure 3 for Variational Autoencoders Without the Variation

Figure 4 for Variational Autoencoders Without the Variation

Abstract:Variational autoencdoers (VAE) are a popular approach to generative modelling. However, exploiting the capabilities of VAEs in practice can be difficult. Recent work on regularised and entropic autoencoders have begun to explore the potential, for generative modelling, of removing the variational approach and returning to the classic deterministic autoencoder (DAE) with additional novel regularisation methods. In this paper we empirically explore the capability of DAEs for image generation without additional novel methods and the effect of the implicit regularisation and smoothness of large networks. We find that DAEs can be used successfully for image generation without additional loss terms, and that many of the useful properties of VAEs can arise implicitly from sufficiently large convolutional encoders and decoders when trained on CIFAR-10 and CelebA.

* 11 pages, 7 figures, 3 tables

Via

Access Paper or Ask Questions

Asynchronous ε-Greedy Bayesian Optimisation

Oct 16, 2020

George De Ath, Richard M. Everson, Jonathan E. Fieldsend

Figure 1 for Asynchronous ε-Greedy Bayesian Optimisation

Figure 2 for Asynchronous ε-Greedy Bayesian Optimisation

Figure 3 for Asynchronous ε-Greedy Bayesian Optimisation

Figure 4 for Asynchronous ε-Greedy Bayesian Optimisation

Abstract:Bayesian Optimisation (BO) is a popular surrogate model-based approach for optimising expensive black-box functions. In order to reduce optimisation wallclock time, parallel evaluation of the black-box function has been proposed. Asynchronous BO allows for a new evaluation to be started as soon as another finishes, thus maximising utilisation of evaluation workers. We present AEGiS (Asynchronous $\epsilon$-Greedy Global Search), an asynchronous BO method that, with probability $2\epsilon$, performs either Thompson sampling or random selection from the approximate Pareto set trading-off between exploitation (surrogate mean prediction) and exploration (surrogate posterior variance). The remaining $1-2\epsilon$ of moves exploit the surrogate's mean prediction. Results on fifteen synthetic benchmark problems, three meta-surrogate hyperparameter tuning problems and two robot pushing problems show that AEGiS generally outperforms existing methods for asynchronous BO. When a single worker is available performance is no worse than BO using expected improvement. We also verify the importance of each of the three components in an ablation study, as well as comparing Pareto set selection to selection from the entire feasible problem domain, finding that the former is vastly superior.

* Submitted to International Conference on Artificial Intelligence and Statistics (AISTATS 2021). 10 pages (main paper) + 17 pages (supplementary material)

Via

Access Paper or Ask Questions

What do you Mean? The Role of the Mean Function in Bayesian Optimisation

May 08, 2020

George De Ath, Jonathan E. Fieldsend, Richard M. Everson

Figure 1 for What do you Mean? The Role of the Mean Function in Bayesian Optimisation

Figure 2 for What do you Mean? The Role of the Mean Function in Bayesian Optimisation

Figure 3 for What do you Mean? The Role of the Mean Function in Bayesian Optimisation

Figure 4 for What do you Mean? The Role of the Mean Function in Bayesian Optimisation

Abstract:Bayesian optimisation is a popular approach for optimising expensive black-box functions. The next location to be evaluated is selected via maximising an acquisition function that balances exploitation and exploration. Gaussian processes, the surrogate models of choice in Bayesian optimisation, are often used with a constant prior mean function equal to the arithmetic mean of the observed function values. We show that the rate of convergence can depend sensitively on the choice of mean function. We empirically investigate 8 mean functions (constant functions equal to the arithmetic mean, minimum, median and maximum of the observed function evaluations, linear, quadratic polynomials, random forests and RBF networks), using 10 synthetic test problems and two real-world problems, and using the Expected Improvement and Upper Confidence Bound acquisition functions. We find that for design dimensions $\ge5$ using a constant mean function equal to the worst observed quality value is consistently the best choice on the synthetic problems considered. We argue that this worst-observed-quality function promotes exploitation leading to more rapid convergence. However, for the real-world tasks the more complex mean functions capable of modelling the fitness landscape may be effective, although there is no clearly optimum choice.

* Genetic and Evolutionary Computation Conference Companion 2020 (GECCO '20 Companion). 9 pages (main paper) + 4 pages (supplementary material). Code avaliable at http://github.com/georgedeath/bomean

Via

Access Paper or Ask Questions

$ε$-shotgun: $ε$-greedy Batch Bayesian Optimisation

Feb 05, 2020

George De Ath, Richard M. Everson, Jonathan E. Fieldsend, Alma A. M. Rahat

Figure 1 for $ε$-shotgun: $ε$-greedy Batch Bayesian Optimisation

Figure 2 for $ε$-shotgun: $ε$-greedy Batch Bayesian Optimisation

Figure 3 for $ε$-shotgun: $ε$-greedy Batch Bayesian Optimisation

Figure 4 for $ε$-shotgun: $ε$-greedy Batch Bayesian Optimisation

Abstract:Bayesian optimisation is a popular, surrogate model-based approach for optimising expensive black-box functions. Given a surrogate model, the next location to expensively evaluate is chosen via maximisation of a cheap-to-query acquisition function. We present an $\epsilon$-greedy procedure for Bayesian optimisation in batch settings in which the black-box function can be evaluated multiple times in parallel. Our $\epsilon$-shotgun algorithm leverages the model's prediction, uncertainty, and the approximated rate of change of the landscape to determine the spread of batch solutions to be distributed around a putative location. The initial target location is selected either in an exploitative fashion on the mean prediction, or -- with probability $\epsilon$ -- from elsewhere in the design space. This results in locations that are more densely sampled in regions where the function is changing rapidly and in locations predicted to be good (i.e close to predicted optima), with more scattered samples in regions where the function is flatter and/or of poorer quality. We empirically evaluate the $\epsilon$-shotgun methods on a range of synthetic functions and two real-world problems, finding that they perform at least as well as state-of-the-art batch methods and in many cases exceed their performance.

* Submitted to Genetic and Evolutionary Computation Conference 2020 (GECCO '20). 9 pages (main paper) + 10 pages (supplementary material)

Via

Access Paper or Ask Questions

Greed is Good: Exploration and Exploitation Trade-offs in Bayesian Optimisation

Nov 28, 2019

George De Ath, Richard M. Everson, Alma A. M. Rahat, Jonathan E. Fieldsend

Figure 1 for Greed is Good: Exploration and Exploitation Trade-offs in Bayesian Optimisation

Figure 2 for Greed is Good: Exploration and Exploitation Trade-offs in Bayesian Optimisation

Figure 3 for Greed is Good: Exploration and Exploitation Trade-offs in Bayesian Optimisation

Figure 4 for Greed is Good: Exploration and Exploitation Trade-offs in Bayesian Optimisation

Abstract:The performance of acquisition functions for Bayesian optimisation is investigated in terms of the Pareto front between exploration and exploitation. We show that Expected Improvement and the Upper Confidence Bound always select solutions to be expensively evaluated on the Pareto front, but Probability of Improvement is never guaranteed to do so and Weighted Expected Improvement does only for a restricted range of weights. We introduce two novel $\epsilon$-greedy acquisition functions. Extensive empirical evaluation of these together with random search, purely exploratory and purely exploitative search on 10 benchmark problems in 1 to 10 dimensions shows that $\epsilon$-greedy algorithms are generally at least as effective as conventional acquisition functions, particularly with a limited budget. In higher dimensions $\epsilon$-greedy approaches are shown to have improved performance over conventional approaches. These results are borne out on a real world computational fluid dynamics optimisation problem and a robotics active learning problem.

* Submitted to ACM Transactions on Evolutionary Learning and Optimization (TELO). 19 pages (main paper) + 18 pages (supplementary material)

Via

Access Paper or Ask Questions

A Bayesian Approach for the Robust Optimisation of Expensive-To-Evaluate Functions

May 09, 2019

Nicholas D. Sanders, Richard M. Everson, Jonathan E. Fieldsend, Alma A. M. Rahat

Figure 1 for A Bayesian Approach for the Robust Optimisation of Expensive-To-Evaluate Functions

Figure 2 for A Bayesian Approach for the Robust Optimisation of Expensive-To-Evaluate Functions

Figure 3 for A Bayesian Approach for the Robust Optimisation of Expensive-To-Evaluate Functions

Figure 4 for A Bayesian Approach for the Robust Optimisation of Expensive-To-Evaluate Functions

Abstract:Many expensive black-box optimisation problems are sensitive to their inputs. In these problems it makes more sense to locate a region of good designs, than a single, possible fragile, optimal design. Expensive black-box functions can be optimised effectively with Bayesian optimisation, where a Gaussian process is a popular choice as a prior over the expensive function. We propose a method for robust optimisation using Bayesian optimisation to find a region of design space in which the expensive function's performance is insensitive to the inputs whilst retaining a good quality. This is achieved by sampling realisations from a Gaussian process modelling the expensive function and evaluating the improvement for each realisation. The expectation of these improvements can be optimised cheaply with an evolutionary algorithm to determine the next location at which to evaluate the expensive function. We describe an efficient process to locate the optimum expected improvement. We show empirically that evaluating the expensive function at the location in the candidate sweet spot about which the model is most uncertain or at random yield the best convergence in contrast to exploitative schemes. We illustrate our method on six test functions in two, five, and ten dimensions, and demonstrate that it is able to outperform a state-of-the-art approach from the literature.

* Submitted to IEEE Transactions on Evolutionary Computation. 11 pages, 8 figures

Via

Access Paper or Ask Questions

Comparison of the Bayesian and Randomised Decision Tree Ensembles within an Uncertainty Envelope Technique

Apr 14, 2005

Vitaly Schetinin, Jonathan E. Fieldsend, Derek Partridge, Wojtek J. Krzanowski, Richard M. Everson, Trevor C. Bailey, Adolfo Hernandez

Figure 1 for Comparison of the Bayesian and Randomised Decision Tree Ensembles within an Uncertainty Envelope Technique

Figure 2 for Comparison of the Bayesian and Randomised Decision Tree Ensembles within an Uncertainty Envelope Technique

Figure 3 for Comparison of the Bayesian and Randomised Decision Tree Ensembles within an Uncertainty Envelope Technique

Figure 4 for Comparison of the Bayesian and Randomised Decision Tree Ensembles within an Uncertainty Envelope Technique

Abstract:Multiple Classifier Systems (MCSs) allow evaluation of the uncertainty of classification outcomes that is of crucial importance for safety critical applications. The uncertainty of classification is determined by a trade-off between the amount of data available for training, the classifier diversity and the required performance. The interpretability of MCSs can also give useful information for experts responsible for making reliable classifications. For this reason Decision Trees (DTs) seem to be attractive classification models for experts. The required diversity of MCSs exploiting such classification models can be achieved by using two techniques, the Bayesian model averaging and the randomised DT ensemble. Both techniques have revealed promising results when applied to real-world problems. In this paper we experimentally compare the classification uncertainty of the Bayesian model averaging with a restarting strategy and the randomised DT ensemble on a synthetic dataset and some domain problems commonly used in the machine learning community. To make the Bayesian DT averaging feasible, we use a Markov Chain Monte Carlo technique. The classification uncertainty is evaluated within an Uncertainty Envelope technique dealing with the class posterior distribution and a given confidence probability. Exploring a full posterior distribution, this technique produces realistic estimates which can be easily interpreted in statistical terms. In our experiments we found out that the Bayesian DTs are superior to the randomised DT ensembles within the Uncertainty Envelope technique.

* Journal of Mathematical Modelling and Algorithms, 2005

Via

Access Paper or Ask Questions

Estimating Classification Uncertainty of Bayesian Decision Tree Technique on Financial Data

Apr 14, 2005

Vitaly Schetinin, Jonathan E. Fieldsend, Derek Partridge, Wojtek J. Krzanowski, Richard M. Everson, Trevor C. Bailey, Adolfo Hernandez

Figure 1 for Estimating Classification Uncertainty of Bayesian Decision Tree Technique on Financial Data

Figure 2 for Estimating Classification Uncertainty of Bayesian Decision Tree Technique on Financial Data

Figure 3 for Estimating Classification Uncertainty of Bayesian Decision Tree Technique on Financial Data

Figure 4 for Estimating Classification Uncertainty of Bayesian Decision Tree Technique on Financial Data

Abstract:Bayesian averaging over classification models allows the uncertainty of classification outcomes to be evaluated, which is of crucial importance for making reliable decisions in applications such as financial in which risks have to be estimated. The uncertainty of classification is determined by a trade-off between the amount of data available for training, the diversity of a classifier ensemble and the required performance. The interpretability of classification models can also give useful information for experts responsible for making reliable classifications. For this reason Decision Trees (DTs) seem to be attractive classification models. The required diversity of the DT ensemble can be achieved by using the Bayesian model averaging all possible DTs. In practice, the Bayesian approach can be implemented on the base of a Markov Chain Monte Carlo (MCMC) technique of random sampling from the posterior distribution. For sampling large DTs, the MCMC method is extended by Reversible Jump technique which allows inducing DTs under given priors. For the case when the prior information on the DT size is unavailable, the sweeping technique defining the prior implicitly reveals a better performance. Within this Chapter we explore the classification uncertainty of the Bayesian MCMC techniques on some datasets from the StatLog Repository and real financial data. The classification uncertainty is compared within an Uncertainty Envelope technique dealing with the class posterior distribution and a given confidence probability. This technique provides realistic estimates of the classification uncertainty which can be easily interpreted in statistical terms with the aim of risk evaluation.

Via

Access Paper or Ask Questions