Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Christophe Marsala

A Robust Autoencoder Ensemble-Based Approach for Anomaly Detection in Text

May 16, 2024

Jeremie Pantin, Christophe Marsala

Figure 1 for A Robust Autoencoder Ensemble-Based Approach for Anomaly Detection in Text

Figure 2 for A Robust Autoencoder Ensemble-Based Approach for Anomaly Detection in Text

Figure 3 for A Robust Autoencoder Ensemble-Based Approach for Anomaly Detection in Text

Figure 4 for A Robust Autoencoder Ensemble-Based Approach for Anomaly Detection in Text

Abstract:In this work, a robust autoencoder ensemble-based approach designed to address anomaly detection in text corpora is introduced. Each autoencoder within the ensemble incorporates a local robust subspace recovery projection of the original data in its encoding embedding, leveraging the geometric properties of the k-nearest neighbors to optimize subspace recovery and identify anomalous patterns in textual data. The evaluation of such an approach needs an experimental setting dedicated to the context of textual anomaly detection. Thus, beforehand, a comprehensive real-world taxonomy is introduced to distinguish between independent anomalies and contextual anomalies. Such a study to identify clearly the kinds of anomalies appearing in a textual context aims at addressing a critical gap in the existing literature. Then, extensive experiments on classical text corpora have been conducted and their results are presented that highlights the efficiency, both in robustness and in performance, of the robust autoencoder ensemble-based approach when detecting both independent and contextual anomalies. Diverse range of tasks, including classification, sentiment analysis, and spam detection, across eight different corpora, have been studied in these experiments.

* Submitted to ECML/PKDD 2024

Via

Access Paper or Ask Questions

Dynamic Interpretability for Model Comparison via Decision Rules

Sep 29, 2023

Adam Rida, Marie-Jeanne Lesot, Xavier Renard, Christophe Marsala

Abstract:Explainable AI (XAI) methods have mostly been built to investigate and shed light on single machine learning models and are not designed to capture and explain differences between multiple models effectively. This paper addresses the challenge of understanding and explaining differences between machine learning models, which is crucial for model selection, monitoring and lifecycle management in real-world applications. We propose DeltaXplainer, a model-agnostic method for generating rule-based explanations describing the differences between two binary classifiers. To assess the effectiveness of DeltaXplainer, we conduct experiments on synthetic and real-world datasets, covering various model comparison scenarios involving different types of concept drift.

Via

Access Paper or Ask Questions

Achieving Diversity in Counterfactual Explanations: a Review and Discussion

May 10, 2023

Thibault Laugel, Adulam Jeyasothy, Marie-Jeanne Lesot, Christophe Marsala, Marcin Detyniecki

Abstract:In the field of Explainable Artificial Intelligence (XAI), counterfactual examples explain to a user the predictions of a trained decision model by indicating the modifications to be made to the instance so as to change its associated prediction. These counterfactual examples are generally defined as solutions to an optimization problem whose cost function combines several criteria that quantify desiderata for a good explanation meeting user needs. A large variety of such appropriate properties can be considered, as the user needs are generally unknown and differ from one user to another; their selection and formalization is difficult. To circumvent this issue, several approaches propose to generate, rather than a single one, a set of diverse counterfactual examples to explain a prediction. This paper proposes a review of the numerous, sometimes conflicting, definitions that have been proposed for this notion of diversity. It discusses their underlying principles as well as the hypotheses on the user needs they rely on and proposes to categorize them along several dimensions (explicit vs implicit, universe in which they are defined, level at which they apply), leading to the identification of further research challenges on this topic.

Via

Access Paper or Ask Questions

Integrating Prior Knowledge in Post-hoc Explanations

Apr 25, 2022

Adulam Jeyasothy, Thibault Laugel, Marie-Jeanne Lesot, Christophe Marsala, Marcin Detyniecki

Figure 1 for Integrating Prior Knowledge in Post-hoc Explanations

Figure 2 for Integrating Prior Knowledge in Post-hoc Explanations

Figure 3 for Integrating Prior Knowledge in Post-hoc Explanations

Figure 4 for Integrating Prior Knowledge in Post-hoc Explanations

Abstract:In the field of eXplainable Artificial Intelligence (XAI), post-hoc interpretability methods aim at explaining to a user the predictions of a trained decision model. Integrating prior knowledge into such interpretability methods aims at improving the explanation understandability and allowing for personalised explanations adapted to each user. In this paper, we propose to define a cost function that explicitly integrates prior knowledge into the interpretability objectives: we present a general framework for the optimization problem of post-hoc interpretability methods, and show that user knowledge can thus be integrated to any method by adding a compatibility term in the cost function. We instantiate the proposed formalization in the case of counterfactual explanations and propose a new interpretability method called Knowledge Integration in Counterfactual Explanation (KICE) to optimize it. The paper performs an experimental study on several benchmark data sets to characterize the counterfactual instances generated by KICE, as compared to reference methods.

* preprint

Via

Access Paper or Ask Questions

The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Jul 22, 2019

Thibault Laugel, Marie-Jeanne Lesot, Christophe Marsala, Xavier Renard, Marcin Detyniecki

Figure 1 for The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Figure 2 for The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Figure 3 for The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Figure 4 for The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Abstract:Post-hoc interpretability approaches have been proven to be powerful tools to generate explanations for the predictions made by a trained black-box model. However, they create the risk of having explanations that are a result of some artifacts learned by the model instead of actual knowledge from the data. This paper focuses on the case of counterfactual explanations and asks whether the generated instances can be justified, i.e. continuously connected to some ground-truth data. We evaluate the risk of generating unjustified counterfactual examples by investigating the local neighborhoods of instances whose predictions are to be explained and show that this risk is quite high for several datasets. Furthermore, we show that most state of the art approaches do not differentiate justified from unjustified counterfactual examples, leading to less useful explanations.

Via

Access Paper or Ask Questions

Issues with post-hoc counterfactual explanations: a discussion

Jun 11, 2019

Thibault Laugel, Marie-Jeanne Lesot, Christophe Marsala, Marcin Detyniecki

Figure 1 for Issues with post-hoc counterfactual explanations: a discussion

Figure 2 for Issues with post-hoc counterfactual explanations: a discussion

Figure 3 for Issues with post-hoc counterfactual explanations: a discussion

Figure 4 for Issues with post-hoc counterfactual explanations: a discussion

Abstract:Counterfactual post-hoc interpretability approaches have been proven to be useful tools to generate explanations for the predictions of a trained blackbox classifier. However, the assumptions they make about the data and the classifier make them unreliable in many contexts. In this paper, we discuss three desirable properties and approaches to quantify them: proximity, connectedness and stability. In addition, we illustrate that there is a risk for post-hoc counterfactual approaches to not satisfy these properties.

* presented at 2019 ICML Workshop on Human in the Loop Learning (HILL 2019), Long Beach, USA

Via

Access Paper or Ask Questions

Detecting Potential Local Adversarial Examples for Human-Interpretable Defense

Sep 07, 2018

Xavier Renard, Thibault Laugel, Marie-Jeanne Lesot, Christophe Marsala, Marcin Detyniecki

Figure 1 for Detecting Potential Local Adversarial Examples for Human-Interpretable Defense

Figure 2 for Detecting Potential Local Adversarial Examples for Human-Interpretable Defense

Abstract:Machine learning models are increasingly used in the industry to make decisions such as credit insurance approval. Some people may be tempted to manipulate specific variables, such as the age or the salary, in order to get better chances of approval. In this ongoing work, we propose to discuss, with a first proposition, the issue of detecting a potential local adversarial example on classical tabular data by providing to a human expert the locally critical features for the classifier's decision, in order to control the provided information and avoid a fraud.

* presented at 2018 ECML/PKDD Workshop on Recent Advances in Adversarial Machine Learning (Nemesis 2018), Dublin, Ireland

Via

Access Paper or Ask Questions

Defining Locality for Surrogates in Post-hoc Interpretablity

Jun 19, 2018

Thibault Laugel, Xavier Renard, Marie-Jeanne Lesot, Christophe Marsala, Marcin Detyniecki

Figure 1 for Defining Locality for Surrogates in Post-hoc Interpretablity

Figure 2 for Defining Locality for Surrogates in Post-hoc Interpretablity

Figure 3 for Defining Locality for Surrogates in Post-hoc Interpretablity

Figure 4 for Defining Locality for Surrogates in Post-hoc Interpretablity

Abstract:Local surrogate models, to approximate the local decision boundary of a black-box classifier, constitute one approach to generate explanations for the rationale behind an individual prediction made by the back-box. This paper highlights the importance of defining the right locality, the neighborhood on which a local surrogate is trained, in order to approximate accurately the local black-box decision boundary. Unfortunately, as shown in this paper, this issue is not only a parameter or sampling distribution challenge and has a major impact on the relevance and quality of the approximation of the local black-box decision boundary and thus on the meaning and accuracy of the generated explanation. To overcome the identified problems, quantified with an adapted measure and procedure, we propose to generate surrogate-based explanations for individual predictions based on a sampling centered on particular place of the decision boundary, relevant for the prediction to be explained, rather than on the prediction itself as it is classically done. We evaluate the novel approach compared to state-of-the-art methods and a straightforward improvement thereof on four UCI datasets.

* presented at 2018 ICML Workshop on Human Interpretability in Machine Learning (WHI 2018), Stockholm, Sweden

Via

Access Paper or Ask Questions

Inverse Classification for Comparison-based Interpretability in Machine Learning

Dec 22, 2017

Thibault Laugel, Marie-Jeanne Lesot, Christophe Marsala, Xavier Renard, Marcin Detyniecki

Figure 1 for Inverse Classification for Comparison-based Interpretability in Machine Learning

Figure 2 for Inverse Classification for Comparison-based Interpretability in Machine Learning

Figure 3 for Inverse Classification for Comparison-based Interpretability in Machine Learning

Figure 4 for Inverse Classification for Comparison-based Interpretability in Machine Learning

Abstract:In the context of post-hoc interpretability, this paper addresses the task of explaining the prediction of a classifier, considering the case where no information is available, neither on the classifier itself, nor on the processed data (neither the training nor the test data). It proposes an instance-based approach whose principle consists in determining the minimal changes needed to alter a prediction: given a data point whose classification must be explained, the proposed method consists in identifying a close neighbour classified differently, where the closeness definition integrates a sparsity constraint. This principle is implemented using observation generation in the Growing Spheres algorithm. Experimental results on two datasets illustrate the relevance of the proposed approach that can be used to gain knowledge about the classifier.

* preprint

Via

Access Paper or Ask Questions