Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Xavier Renard

Post-processing fairness with minimal changes

Aug 27, 2024

Federico Di Gennaro, Thibault Laugel, Vincent Grari, Xavier Renard, Marcin Detyniecki

Abstract:In this paper, we introduce a novel post-processing algorithm that is both model-agnostic and does not require the sensitive attribute at test time. In addition, our algorithm is explicitly designed to enforce minimal changes between biased and debiased predictions; a property that, while highly desirable, is rarely prioritized as an explicit objective in fairness literature. Our approach leverages a multiplicative factor applied to the logit value of probability scores produced by a black-box classifier. We demonstrate the efficacy of our method through empirical evaluations, comparing its performance against other four debiasing algorithms on two widely used datasets in fairness research.

Via

Access Paper or Ask Questions

Dynamic Interpretability for Model Comparison via Decision Rules

Sep 29, 2023

Adam Rida, Marie-Jeanne Lesot, Xavier Renard, Christophe Marsala

Abstract:Explainable AI (XAI) methods have mostly been built to investigate and shed light on single machine learning models and are not designed to capture and explain differences between multiple models effectively. This paper addresses the challenge of understanding and explaining differences between machine learning models, which is crucial for model selection, monitoring and lifecycle management in real-world applications. We propose DeltaXplainer, a model-agnostic method for generating rule-based explanations describing the differences between two binary classifiers. To assess the effectiveness of DeltaXplainer, we conduct experiments on synthetic and real-world datasets, covering various model comparison scenarios involving different types of concept drift.

Via

Access Paper or Ask Questions

How to choose an Explainability Method? Towards a Methodical Implementation of XAI in Practice

Jul 09, 2021

Tom Vermeire, Thibault Laugel, Xavier Renard, David Martens, Marcin Detyniecki

Figure 1 for How to choose an Explainability Method? Towards a Methodical Implementation of XAI in Practice

Abstract:Explainability is becoming an important requirement for organizations that make use of automated decision-making due to regulatory initiatives and a shift in public awareness. Various and significantly different algorithmic methods to provide this explainability have been introduced in the field, but the existing literature in the machine learning community has paid little attention to the stakeholder whose needs are rather studied in the human-computer interface community. Therefore, organizations that want or need to provide this explainability are confronted with the selection of an appropriate method for their use case. In this paper, we argue there is a need for a methodology to bridge the gap between stakeholder needs and explanation methods. We present our ongoing work on creating this methodology to help data scientists in the process of providing explainability to stakeholders. In particular, our contributions include documents used to characterize XAI methods and user requirements (shown in Appendix), which our methodology builds upon.

Via

Access Paper or Ask Questions

Understanding surrogate explanations: the interplay between complexity, fidelity and coverage

Jul 09, 2021

Rafael Poyiadzi, Xavier Renard, Thibault Laugel, Raul Santos-Rodriguez, Marcin Detyniecki

Figure 1 for Understanding surrogate explanations: the interplay between complexity, fidelity and coverage

Figure 2 for Understanding surrogate explanations: the interplay between complexity, fidelity and coverage

Figure 3 for Understanding surrogate explanations: the interplay between complexity, fidelity and coverage

Figure 4 for Understanding surrogate explanations: the interplay between complexity, fidelity and coverage

Abstract:This paper analyses the fundamental ingredients behind surrogate explanations to provide a better understanding of their inner workings. We start our exposition by considering global surrogates, describing the trade-off between complexity of the surrogate and fidelity to the black-box being modelled. We show that transitioning from global to local - reducing coverage - allows for more favourable conditions on the Pareto frontier of fidelity-complexity of a surrogate. We discuss the interplay between complexity, fidelity and coverage, and consider how different user needs can lead to problem formulations where these are either constraints or penalties. We also present experiments that demonstrate how the local surrogate interpretability procedure can be made interactive and lead to better explanations.

* 12 pages, 8 figures

Via

Access Paper or Ask Questions

On the overlooked issue of defining explanation objectives for local-surrogate explainers

Jun 10, 2021

Rafael Poyiadzi, Xavier Renard, Thibault Laugel, Raul Santos-Rodriguez, Marcin Detyniecki

Figure 1 for On the overlooked issue of defining explanation objectives for local-surrogate explainers

Figure 2 for On the overlooked issue of defining explanation objectives for local-surrogate explainers

Figure 3 for On the overlooked issue of defining explanation objectives for local-surrogate explainers

Abstract:Local surrogate approaches for explaining machine learning model predictions have appealing properties, such as being model-agnostic and flexible in their modelling. Several methods exist that fit this description and share this goal. However, despite their shared overall procedure, they set out different objectives, extract different information from the black-box, and consequently produce diverse explanations, that are -- in general -- incomparable. In this work we review the similarities and differences amongst multiple methods, with a particular focus on what information they extract from the model, as this has large impact on the output: the explanation. We discuss the implications of the lack of agreement, and clarity, amongst the methods' objectives on the research and practice of explainability.

Via

Access Paper or Ask Questions

Understanding Prediction Discrepancies in Machine Learning Classifiers

Apr 12, 2021

Xavier Renard, Thibault Laugel, Marcin Detyniecki

Figure 1 for Understanding Prediction Discrepancies in Machine Learning Classifiers

Figure 2 for Understanding Prediction Discrepancies in Machine Learning Classifiers

Figure 3 for Understanding Prediction Discrepancies in Machine Learning Classifiers

Figure 4 for Understanding Prediction Discrepancies in Machine Learning Classifiers

Abstract:A multitude of classifiers can be trained on the same data to achieve similar performances during test time, while having learned significantly different classification patterns. This phenomenon, which we call prediction discrepancies, is often associated with the blind selection of one model instead of another with similar performances. When making a choice, the machine learning practitioner has no understanding on the differences between models, their limits, where they agree and where they don't. But his/her choice will result in concrete consequences for instances to be classified in the discrepancy zone, since the final decision will be based on the selected classification pattern. Besides the arbitrary nature of the result, a bad choice could have further negative consequences such as loss of opportunity or lack of fairness. This paper proposes to address this question by analyzing the prediction discrepancies in a pool of best-performing models trained on the same data. A model-agnostic algorithm, DIG, is proposed to capture and explain discrepancies locally, to enable the practitioner to make the best educated decision when selecting a model by anticipating its potential undesired consequences. All the code to reproduce the experiments is available.

Via

Access Paper or Ask Questions

QUACKIE: A NLP Classification Task With Ground Truth Explanations

Dec 27, 2020

Yves Rychener, Xavier Renard, Djamé Seddah, Pascal Frossard, Marcin Detyniecki

Figure 1 for QUACKIE: A NLP Classification Task With Ground Truth Explanations

Figure 2 for QUACKIE: A NLP Classification Task With Ground Truth Explanations

Figure 3 for QUACKIE: A NLP Classification Task With Ground Truth Explanations

Figure 4 for QUACKIE: A NLP Classification Task With Ground Truth Explanations

Abstract:NLP Interpretability aims to increase trust in model predictions. This makes evaluating interpretability approaches a pressing issue. There are multiple datasets for evaluating NLP Interpretability, but their dependence on human provided ground truths raises questions about their unbiasedness. In this work, we take a different approach and formulate a specific classification task by diverting question-answering datasets. For this custom classification task, the interpretability ground-truth arises directly from the definition of the classification problem. We use this method to propose a benchmark and lay the groundwork for future research in NLP interpretability by evaluating a wide range of current state of the art methods.

Via

Access Paper or Ask Questions

Sentence-Based Model Agnostic NLP Interpretability

Dec 27, 2020

Yves Rychener, Xavier Renard, Djamé Seddah, Pascal Frossard, Marcin Detyniecki

Figure 1 for Sentence-Based Model Agnostic NLP Interpretability

Figure 2 for Sentence-Based Model Agnostic NLP Interpretability

Figure 3 for Sentence-Based Model Agnostic NLP Interpretability

Figure 4 for Sentence-Based Model Agnostic NLP Interpretability

Abstract:Today, interpretability of Black-Box Natural Language Processing (NLP) models based on surrogates, like LIME or SHAP, uses word-based sampling to build the explanations. In this paper we explore the use of sentences to tackle NLP interpretability. While this choice may seem straight forward, we show that, when using complex classifiers like BERT, the word-based approach raises issues not only of computational complexity, but also of an out of distribution sampling, eventually leading to non founded explanations. By using sentences, the altered text remains in-distribution and the dimensionality of the problem is reduced for better fidelity to the black-box at comparable computational complexity.

Via

Access Paper or Ask Questions

Imperceptible Adversarial Attacks on Tabular Data

Dec 13, 2019

Vincent Ballet, Xavier Renard, Jonathan Aigrain, Thibault Laugel, Pascal Frossard, Marcin Detyniecki

Figure 1 for Imperceptible Adversarial Attacks on Tabular Data

Figure 2 for Imperceptible Adversarial Attacks on Tabular Data

Figure 3 for Imperceptible Adversarial Attacks on Tabular Data

Figure 4 for Imperceptible Adversarial Attacks on Tabular Data

Abstract:Security of machine learning models is a concern as they may face adversarial attacks for unwarranted advantageous decisions. While research on the topic has mainly been focusing on the image domain, numerous industrial applications, in particular in finance, rely on standard tabular data. In this paper, we discuss the notion of adversarial examples in the tabular domain. We propose a formalization based on the imperceptibility of attacks in the tabular domain leading to an approach to generate imperceptible adversarial examples. Experiments show that we can generate imperceptible adversarial examples with a high fooling rate.

* presented at NeurIPS 2019 Workshop on Robust AI in Financial Services: Data, Fairness, Explainability, Trustworthiness, and Privacy (Robust AI in FS 2019), Vancouver, Canada

Via

Access Paper or Ask Questions

The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Jul 22, 2019

Thibault Laugel, Marie-Jeanne Lesot, Christophe Marsala, Xavier Renard, Marcin Detyniecki

Figure 1 for The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Figure 2 for The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Figure 3 for The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Figure 4 for The Dangers of Post-hoc Interpretability: Unjustified Counterfactual Explanations

Abstract:Post-hoc interpretability approaches have been proven to be powerful tools to generate explanations for the predictions made by a trained black-box model. However, they create the risk of having explanations that are a result of some artifacts learned by the model instead of actual knowledge from the data. This paper focuses on the case of counterfactual explanations and asks whether the generated instances can be justified, i.e. continuously connected to some ground-truth data. We evaluate the risk of generating unjustified counterfactual examples by investigating the local neighborhoods of instances whose predictions are to be explained and show that this risk is quite high for several datasets. Furthermore, we show that most state of the art approaches do not differentiate justified from unjustified counterfactual examples, leading to less useful explanations.

Via

Access Paper or Ask Questions