Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Lars Carlsson

High-dimensional near-optimal experiment design for drug discovery via Bayesian sparse sampling

Apr 23, 2021

Hannes Eriksson, Christos Dimitrakakis, Lars Carlsson

Figure 1 for High-dimensional near-optimal experiment design for drug discovery via Bayesian sparse sampling

Figure 2 for High-dimensional near-optimal experiment design for drug discovery via Bayesian sparse sampling

Figure 3 for High-dimensional near-optimal experiment design for drug discovery via Bayesian sparse sampling

Figure 4 for High-dimensional near-optimal experiment design for drug discovery via Bayesian sparse sampling

Abstract:We study the problem of performing automated experiment design for drug screening through Bayesian inference and optimisation. In particular, we compare and contrast the behaviour of linear-Gaussian models and Gaussian processes, when used in conjunction with upper confidence bound algorithms, Thompson sampling, or bounded horizon tree search. We show that non-myopic sophisticated exploration techniques using sparse tree search have a distinct advantage over methods such as Thompson sampling or upper confidence bounds in this setting. We demonstrate the significant superiority of the approach over existing and synthetic datasets of drug toxicity.

* 14 pages, 6 figures

Via

Access Paper or Ask Questions

Retrain or not retrain: Conformal test martingales for change-point detection

Feb 20, 2021

Vladimir Vovk, Ivan Petej, Ilia Nouretdinov, Ernst Ahlberg, Lars Carlsson, Alex Gammerman

Figure 1 for Retrain or not retrain: Conformal test martingales for change-point detection

Figure 2 for Retrain or not retrain: Conformal test martingales for change-point detection

Figure 3 for Retrain or not retrain: Conformal test martingales for change-point detection

Figure 4 for Retrain or not retrain: Conformal test martingales for change-point detection

Abstract:We argue for supplementing the process of training a prediction algorithm by setting up a scheme for detecting the moment when the distribution of the data changes and the algorithm needs to be retrained. Our proposed schemes are based on exchangeability martingales, i.e., processes that are martingales under any exchangeable distribution for the data. Our method, based on conformal prediction, is general and can be applied on top of any modern prediction algorithm. Its validity is guaranteed, and in this paper we make first steps in exploring its efficiency.

* 22 pages, 19 figures, 3 tables

Via

Access Paper or Ask Questions

Combining Prediction Intervals on Multi-Source Non-Disclosed Regression Datasets

Aug 15, 2019

Ola Spjuth, Robin Carrión Brännström, Lars Carlsson, Niharika Gauraha

Figure 1 for Combining Prediction Intervals on Multi-Source Non-Disclosed Regression Datasets

Figure 2 for Combining Prediction Intervals on Multi-Source Non-Disclosed Regression Datasets

Figure 3 for Combining Prediction Intervals on Multi-Source Non-Disclosed Regression Datasets

Figure 4 for Combining Prediction Intervals on Multi-Source Non-Disclosed Regression Datasets

Abstract:Conformal Prediction is a framework that produces prediction intervals based on the output from a machine learning algorithm. In this paper we explore the case when training data is made up of multiple parts available in different sources that cannot be pooled. We here consider the regression case and propose a method where a conformal predictor is trained on each data source independently, and where the prediction intervals are then combined into a single interval. We call the approach Non-Disclosed Conformal Prediction (NDCP), and we evaluate it on a regression dataset from the UCI machine learning repository using support vector regression as the underlying machine learning algorithm, with varying number of data sources and sizes. The results show that the proposed method produces conservatively valid prediction intervals, and while we cannot retain the same efficiency as when all data is used, efficiency is improved through the proposed approach as compared to predicting using a single arbitrarily chosen source.

* Accepted to 8th Symposium on Conformal and Probabilistic Prediction with Applications, Golden Sands, Bulgaria, 2019

Via

Access Paper or Ask Questions

Aggregating Predictions on Multiple Non-disclosed Datasets using Conformal Prediction

Jun 14, 2018

Ola Spjuth, Lars Carlsson, Niharika Gauraha

Figure 1 for Aggregating Predictions on Multiple Non-disclosed Datasets using Conformal Prediction

Figure 2 for Aggregating Predictions on Multiple Non-disclosed Datasets using Conformal Prediction

Figure 3 for Aggregating Predictions on Multiple Non-disclosed Datasets using Conformal Prediction

Figure 4 for Aggregating Predictions on Multiple Non-disclosed Datasets using Conformal Prediction

Abstract:Conformal Prediction is a machine learning methodology that produces valid prediction regions under mild conditions. In this paper, we explore the application of making predictions over multiple data sources of different sizes without disclosing data between the sources. We propose that each data source applies a transductive conformal predictor independently using the local data, and that the individual predictions are then aggregated to form a combined prediction region. We demonstrate the method on several data sets, and show that the proposed method produces conservatively valid predictions and reduces the variance in the aggregated predictions. We also study the effect that the number of data sources and size of each source has on aggregated predictions, as compared with equally sized sources and pooled data.

Via

Access Paper or Ask Questions

Conformal Prediction in Learning Under Privileged Information Paradigm with Applications in Drug Discovery

Apr 04, 2018

Niharika Gauraha, Lars Carlsson, Ola Spjuth

Figure 1 for Conformal Prediction in Learning Under Privileged Information Paradigm with Applications in Drug Discovery

Figure 2 for Conformal Prediction in Learning Under Privileged Information Paradigm with Applications in Drug Discovery

Figure 3 for Conformal Prediction in Learning Under Privileged Information Paradigm with Applications in Drug Discovery

Figure 4 for Conformal Prediction in Learning Under Privileged Information Paradigm with Applications in Drug Discovery

Abstract:This paper explores conformal prediction in the learning under privileged information (LUPI) paradigm. We use the SVM+ realization of LUPI in an inductive conformal predictor, and apply it to the MNIST benchmark dataset and three datasets in drug discovery. The results show that using privileged information produces valid models and improves efficiency compared to standard SVM, however the improvement varies between the tested datasets and is not substantial in the drug discovery applications. More importantly, using SVM+ in a conformal prediction framework enables valid prediction intervals at specified significance levels.

Via

Access Paper or Ask Questions