Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Sébastien Destercke

Labex MS2T

Reducing Aleatoric and Epistemic Uncertainty through Multi-modal Data Acquisition

Jan 30, 2025

Arthur Hoarau, Benjamin Quost, Sébastien Destercke, Willem Waegeman

Figure 1 for Reducing Aleatoric and Epistemic Uncertainty through Multi-modal Data Acquisition

Figure 2 for Reducing Aleatoric and Epistemic Uncertainty through Multi-modal Data Acquisition

Figure 3 for Reducing Aleatoric and Epistemic Uncertainty through Multi-modal Data Acquisition

Figure 4 for Reducing Aleatoric and Epistemic Uncertainty through Multi-modal Data Acquisition

Abstract:To generate accurate and reliable predictions, modern AI systems need to combine data from multiple modalities, such as text, images, audio, spreadsheets, and time series. Multi-modal data introduces new opportunities and challenges for disentangling uncertainty: it is commonly assumed in the machine learning community that epistemic uncertainty can be reduced by collecting more data, while aleatoric uncertainty is irreducible. However, this assumption is challenged in modern AI systems when information is obtained from different modalities. This paper introduces an innovative data acquisition framework where uncertainty disentanglement leads to actionable decisions, allowing sampling in two directions: sample size and data modality. The main hypothesis is that aleatoric uncertainty decreases as the number of modalities increases, while epistemic uncertainty decreases by collecting more observations. We provide proof-of-concept implementations on two multi-modal datasets to showcase our data acquisition framework, which combines ideas from active learning, active feature acquisition and uncertainty quantification.

Via

Access Paper or Ask Questions

Guaranteed confidence-band enclosures for PDE surrogates

Jan 30, 2025

Ander Gray, Vignesh Gopakumar, Sylvain Rousseau, Sébastien Destercke

Figure 1 for Guaranteed confidence-band enclosures for PDE surrogates

Figure 2 for Guaranteed confidence-band enclosures for PDE surrogates

Figure 3 for Guaranteed confidence-band enclosures for PDE surrogates

Figure 4 for Guaranteed confidence-band enclosures for PDE surrogates

Abstract:We propose a method for obtaining statistically guaranteed confidence bands for functional machine learning techniques: surrogate models which map between function spaces, motivated by the need build reliable PDE emulators. The method constructs nested confidence sets on a low-dimensional representation (an SVD) of the surrogate model's prediction error, and then maps these sets to the prediction space using set-propagation techniques. The result are conformal-like coverage guaranteed prediction sets for functional surrogate models. We use zonotopes as basis of the set construction, due to their well studied set-propagation and verification properties. The method is model agnostic and can thus be applied to complex Sci-ML models, including Neural Operators, but also in simpler settings. We also elicit a technique to capture the truncation error of the SVD, ensuring the guarantees of the method.

Via

Access Paper or Ask Questions

Robust Confidence Intervals in Stereo Matching using Possibility Theory

Apr 09, 2024

Roman Malinowski, Emmanuelle Sarrazin, Loïc Dumas, Emmanuel Dubois, Sébastien Destercke

Figure 1 for Robust Confidence Intervals in Stereo Matching using Possibility Theory

Figure 2 for Robust Confidence Intervals in Stereo Matching using Possibility Theory

Figure 3 for Robust Confidence Intervals in Stereo Matching using Possibility Theory

Figure 4 for Robust Confidence Intervals in Stereo Matching using Possibility Theory

Abstract:We propose a method for estimating disparity confidence intervals in stereo matching problems. Confidence intervals provide complementary information to usual confidence measures. To the best of our knowledge, this is the first method creating disparity confidence intervals based on the cost volume. This method relies on possibility distributions to interpret the epistemic uncertainty of the cost volume. Our method has the benefit of having a white-box nature, differing in this respect from current state-of-the-art deep neural networks approaches. The accuracy and size of confidence intervals are validated using the Middlebury stereo datasets as well as a dataset of satellite images. This contribution is freely available on GitHub.

Via

Access Paper or Ask Questions

Skeptical binary inferences in multi-label problems with sets of probabilities

May 02, 2022

Yonatan Carlos Carranza Alarcón, Sébastien Destercke

Figure 1 for Skeptical binary inferences in multi-label problems with sets of probabilities

Figure 2 for Skeptical binary inferences in multi-label problems with sets of probabilities

Figure 3 for Skeptical binary inferences in multi-label problems with sets of probabilities

Figure 4 for Skeptical binary inferences in multi-label problems with sets of probabilities

Abstract:In this paper, we consider the problem of making distributionally robust, skeptical inferences for the multi-label problem, or more generally for Boolean vectors. By distributionally robust, we mean that we consider a set of possible probability distributions, and by skeptical we understand that we consider as valid only those inferences that are true for every distribution within this set. Such inferences will provide partial predictions whenever the considered set is sufficiently big. We study in particular the Hamming loss case, a common loss function in multi-label problems, showing how skeptical inferences can be made in this setting. Our experimental results are organised in three sections; (1) the first one indicates the gain computational obtained from our theoretical results by using synthetical data sets, (2) the second one indicates that our approaches produce relevant cautiousness on those hard-to-predict instances where its precise counterpart fails, and (3) the last one demonstrates experimentally how our approach copes with imperfect information (generated by a downsampling procedure) better than the partial abstention [31] and the rejection rules.

Via

Access Paper or Ask Questions

Multi-label Chaining with Imprecise Probabilities

Jul 19, 2021

Yonatan Carlos Carranza Alarcón, Sébastien Destercke

Figure 1 for Multi-label Chaining with Imprecise Probabilities

Figure 2 for Multi-label Chaining with Imprecise Probabilities

Figure 3 for Multi-label Chaining with Imprecise Probabilities

Figure 4 for Multi-label Chaining with Imprecise Probabilities

Abstract:We present two different strategies to extend the classical multi-label chaining approach to handle imprecise probability estimates. These estimates use convex sets of distributions (or credal sets) in order to describe our uncertainty rather than a precise one. The main reasons one could have for using such estimations are (1) to make cautious predictions (or no decision at all) when a high uncertainty is detected in the chaining and (2) to make better precise predictions by avoiding biases caused in early decisions in the chaining. We adapt both strategies to the case of the naive credal classifier, showing that this adaptations are computationally efficient. Our experimental results on missing labels, which investigate how reliable these predictions are in both approaches, indicate that our approaches produce relevant cautiousness on those hard-to-predict instances where the precise models fail.

Via

Access Paper or Ask Questions

Copula-based conformal prediction for Multi-Target Regression

Jan 28, 2021

Soundouss Messoudi, Sébastien Destercke, Sylvain Rousseau

Figure 1 for Copula-based conformal prediction for Multi-Target Regression

Figure 2 for Copula-based conformal prediction for Multi-Target Regression

Figure 3 for Copula-based conformal prediction for Multi-Target Regression

Figure 4 for Copula-based conformal prediction for Multi-Target Regression

Abstract:There are relatively few works dealing with conformal prediction for multi-task learning issues, and this is particularly true for multi-target regression. This paper focuses on the problem of providing valid (i.e., frequency calibrated) multi-variate predictions. To do so, we propose to use copula functions applied to deep neural networks for inductive conformal prediction. We show that the proposed method ensures efficiency and validity for multi-target regression problems on various data sets.

* 17 pages, 8 figures, under review

Via

Access Paper or Ask Questions

From Shallow to Deep Interactions Between Knowledge Representation, Reasoning and Machine Learning (Kay R. Amel group)

Dec 13, 2019

Zied Bouraoui, Antoine Cornuéjols, Thierry Denœux, Sébastien Destercke, Didier Dubois, Romain Guillaume, João Marques-Silva, Jérôme Mengin, Henri Prade, Steven Schockaert(+2 more)

Abstract:This paper proposes a tentative and original survey of meeting points between Knowledge Representation and Reasoning (KRR) and Machine Learning (ML), two areas which have been developing quite separately in the last three decades. Some common concerns are identified and discussed such as the types of used representation, the roles of knowledge and data, the lack or the excess of information, or the need for explanations and causal understanding. Then some methodologies combining reasoning and learning are reviewed (such as inductive logic programming, neuro-symbolic reasoning, formal concept analysis, rule-based representations and ML, uncertainty in ML, or case-based reasoning and analogical reasoning), before discussing examples of synergies between KRR and ML (including topics such as belief functions on regression, EM algorithm versus revision, the semantic description of vector representations, the combination of deep learning with high level inference, knowledge graph completion, declarative frameworks for data mining, or preferences and recommendation). This paper is the first step of a work in progress aiming at a better mutual understanding of research in KRR and ML, and how they could cooperate.

* 53 pages

Via

Access Paper or Ask Questions

Epistemic Uncertainty Sampling

Aug 31, 2019

Vu-Linh Nguyen, Sébastien Destercke, Eyke Hüllermeier

Figure 1 for Epistemic Uncertainty Sampling

Figure 2 for Epistemic Uncertainty Sampling

Figure 3 for Epistemic Uncertainty Sampling

Figure 4 for Epistemic Uncertainty Sampling

Abstract:Various strategies for active learning have been proposed in the machine learning literature. In uncertainty sampling, which is among the most popular approaches, the active learner sequentially queries the label of those instances for which its current prediction is maximally uncertain. The predictions as well as the measures used to quantify the degree of uncertainty, such as entropy, are almost exclusively of a probabilistic nature. In this paper, we advocate a distinction between two different types of uncertainty, referred to as epistemic and aleatoric, in the context of active learning. Roughly speaking, these notions capture the reducible and the irreducible part of the total uncertainty in a prediction, respectively. We conjecture that, in uncertainty sampling, the usefulness of an instance is better reflected by its epistemic than by its aleatoric uncertainty. This leads us to suggest the principle of "epistemic uncertainty sampling", which we instantiate by means of a concrete approach for measuring epistemic and aleatoric uncertainty. In experimental studies, epistemic uncertainty sampling does indeed show promising performance.

* Draft version of a paper to be published in the proceedings of DS 2019, 22nd International Conference on Discovery Science, Split, Croatia, 2019

Via

Access Paper or Ask Questions

Reasons and Means to Model Preferences as Incomplete

Jan 05, 2018

Olivier Cailloux, Sébastien Destercke

Abstract:Literature involving preferences of artificial agents or human beings often assume their preferences can be represented using a complete transitive binary relation. Much has been written however on different models of preferences. We review some of the reasons that have been put forward to justify more complex modeling, and review some of the techniques that have been proposed to obtain models of such preferences.

* 11th International Conference on Scalable Uncertainty Management, 2017

Via

Access Paper or Ask Questions

Building an interpretable fuzzy rule base from data using Orthogonal Least Squares Application to a depollution problem

Aug 21, 2008

Sébastien Destercke, Serge Guillaume, Brigitte Charnomordic

Figure 1 for Building an interpretable fuzzy rule base from data using Orthogonal Least Squares Application to a depollution problem

Figure 2 for Building an interpretable fuzzy rule base from data using Orthogonal Least Squares Application to a depollution problem

Figure 3 for Building an interpretable fuzzy rule base from data using Orthogonal Least Squares Application to a depollution problem

Figure 4 for Building an interpretable fuzzy rule base from data using Orthogonal Least Squares Application to a depollution problem

Abstract:In many fields where human understanding plays a crucial role, such as bioprocesses, the capacity of extracting knowledge from data is of critical importance. Within this framework, fuzzy learning methods, if properly used, can greatly help human experts. Amongst these methods, the aim of orthogonal transformations, which have been proven to be mathematically robust, is to build rules from a set of training data and to select the most important ones by linear regression or rank revealing techniques. The OLS algorithm is a good representative of those methods. However, it was originally designed so that it only cared about numerical performance. Thus, we propose some modifications of the original method to take interpretability into account. After recalling the original algorithm, this paper presents the changes made to the original method, then discusses some results obtained from benchmark problems. Finally, the algorithm is applied to a real-world fault detection depollution problem.

* Fuzzy Sets and Systems 158, 18 (2007) 2078-2094
* pre-print of final version published in Fuzzy Sets and Systems

Via

Access Paper or Ask Questions