Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Alexander Gammerman

Calibrated Large Language Models for Binary Question Answering

Jul 01, 2024

Patrizio Giovannotti, Alexander Gammerman

Abstract:Quantifying the uncertainty of predictions made by large language models (LLMs) in binary text classification tasks remains a challenge. Calibration, in the context of LLMs, refers to the alignment between the model's predicted probabilities and the actual correctness of its predictions. A well-calibrated model should produce probabilities that accurately reflect the likelihood of its predictions being correct. We propose a novel approach that utilizes the inductive Venn--Abers predictor (IVAP) to calibrate the probabilities associated with the output tokens corresponding to the binary labels. Our experiments on the BoolQ dataset using the Llama 2 model demonstrate that IVAP consistently outperforms the commonly used temperature scaling method for various label token choices, achieving well-calibrated probabilities while maintaining high predictive quality. Our findings contribute to the understanding of calibration techniques for LLMs and provide a practical solution for obtaining reliable uncertainty estimates in binary question answering tasks, enhancing the interpretability and trustworthiness of LLM predictions.

* Accepted to COPA 2024 (13th Symposium on Conformal and Probabilistic Prediction with Applications)

Via

Access Paper or Ask Questions

Inductive Conformal Martingales for Change-Point Detection

Jun 11, 2017

Denis Volkhonskiy, Ilia Nouretdinov, Alexander Gammerman, Vladimir Vovk, Evgeny Burnaev

Figure 1 for Inductive Conformal Martingales for Change-Point Detection

Figure 2 for Inductive Conformal Martingales for Change-Point Detection

Figure 3 for Inductive Conformal Martingales for Change-Point Detection

Figure 4 for Inductive Conformal Martingales for Change-Point Detection

Abstract:We consider the problem of quickest change-point detection in data streams. Classical change-point detection procedures, such as CUSUM, Shiryaev-Roberts and Posterior Probability statistics, are optimal only if the change-point model is known, which is an unrealistic assumption in typical applied problems. Instead we propose a new method for change-point detection based on Inductive Conformal Martingales, which requires only the independence and identical distribution of observations. We compare the proposed approach to standard methods, as well as to change-point detection oracles, which model a typical practical situation when we have only imprecise (albeit parametric) information about pre- and post-change data distributions. Results of comparison provide evidence that change-point detection based on Inductive Conformal Martingales is an efficient tool, capable to work under quite general conditions unlike traditional approaches.

* 22 pages, 9 figures, 5 tables

Via

Access Paper or Ask Questions

Conformal Predictors for Compound Activity Prediction

Mar 14, 2016

Paolo Toccacheli, Ilia Nouretdinov, Alexander Gammerman

Figure 1 for Conformal Predictors for Compound Activity Prediction

Figure 2 for Conformal Predictors for Compound Activity Prediction

Figure 3 for Conformal Predictors for Compound Activity Prediction

Figure 4 for Conformal Predictors for Compound Activity Prediction

Abstract:The paper presents an application of Conformal Predictors to a chemoinformatics problem of identifying activities of chemical compounds. The paper addresses some specific challenges of this domain: a large number of compounds (training examples), high-dimensionality of feature space, sparseness and a strong class imbalance. A variant of conformal predictors called Inductive Mondrian Conformal Predictor is applied to deal with these challenges. Results are presented for several non-conformity measures (NCM) extracted from underlying algorithms and different kernels. A number of performance measures are used in order to demonstrate the flexibility of Inductive Mondrian Conformal Predictors in dealing with such a complex set of data. Keywords: Conformal Prediction, Confidence Estimation, Chemoinformatics, Non-Conformity Measure.

* 17 pages, 5 figures

Via

Access Paper or Ask Questions

Hedging predictions in machine learning

Nov 02, 2006

Alexander Gammerman, Vladimir Vovk

Figure 1 for Hedging predictions in machine learning

Figure 2 for Hedging predictions in machine learning

Figure 3 for Hedging predictions in machine learning

Figure 4 for Hedging predictions in machine learning

Abstract:Recent advances in machine learning make it possible to design efficient prediction algorithms for data sets with huge numbers of parameters. This paper describes a new technique for "hedging" the predictions output by many such algorithms, including support vector machines, kernel ridge regression, kernel nearest neighbours, and by many other state-of-the-art methods. The hedged predictions for the labels of new objects include quantitative measures of their own accuracy and reliability. These measures are provably valid under the assumption of randomness, traditional in machine learning: the objects and their labels are assumed to be generated independently from the same probability distribution. In particular, it becomes possible to control (up to statistical fluctuations) the number of erroneous predictions by selecting a suitable confidence level. Validity being achieved automatically, the remaining goal of hedged prediction is efficiency: taking full account of the new objects' features and other available information to produce as accurate predictions as possible. This can be done successfully using the powerful machinery of modern machine learning.

* Computer Journal, 50:151-177, 2007
* 24 pages; 9 figures; 2 tables; a version of this paper (with discussion and rejoinder) is to appear in "The Computer Journal"

Via

Access Paper or Ask Questions