Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Luis Galárraga

LACODAM, IRISA

REFINE-LM: Mitigating Language Model Stereotypes via Reinforcement Learning

Aug 18, 2024

Rameez Qureshi, Naïm Es-Sebbani, Luis Galárraga, Yvette Graham, Miguel Couceiro, Zied Bouraoui

Figure 1 for REFINE-LM: Mitigating Language Model Stereotypes via Reinforcement Learning

Figure 2 for REFINE-LM: Mitigating Language Model Stereotypes via Reinforcement Learning

Figure 3 for REFINE-LM: Mitigating Language Model Stereotypes via Reinforcement Learning

Figure 4 for REFINE-LM: Mitigating Language Model Stereotypes via Reinforcement Learning

Abstract:With the introduction of (large) language models, there has been significant concern about the unintended bias such models may inherit from their training data. A number of studies have shown that such models propagate gender stereotypes, as well as geographical and racial bias, among other biases. While existing works tackle this issue by preprocessing data and debiasing embeddings, the proposed methods require a lot of computational resources and annotation effort while being limited to certain types of biases. To address these issues, we introduce REFINE-LM, a debiasing method that uses reinforcement learning to handle different types of biases without any fine-tuning. By training a simple model on top of the word probability distribution of a LM, our bias agnostic reinforcement learning method enables model debiasing without human annotations or significant computational resources. Experiments conducted on a wide range of models, including several LMs, show that our method (i) significantly reduces stereotypical biases while preserving LMs performance; (ii) is applicable to different types of biases, generalizing across contexts such as gender, ethnicity, religion, and nationality-based biases; and (iii) it is not expensive to train.

Via

Access Paper or Ask Questions

Does It Make Sense to Explain a Black Box With Another Black Box?

Apr 23, 2024

Julien Delaunay, Luis Galárraga, Christine Largouët

Figure 1 for Does It Make Sense to Explain a Black Box With Another Black Box?

Figure 2 for Does It Make Sense to Explain a Black Box With Another Black Box?

Figure 3 for Does It Make Sense to Explain a Black Box With Another Black Box?

Figure 4 for Does It Make Sense to Explain a Black Box With Another Black Box?

Abstract:Although counterfactual explanations are a popular approach to explain ML black-box classifiers, they are less widespread in NLP. Most methods find those explanations by iteratively perturbing the target document until it is classified differently by the black box. We identify two main families of counterfactual explanation methods in the literature, namely, (a) \emph{transparent} methods that perturb the target by adding, removing, or replacing words, and (b) \emph{opaque} approaches that project the target document into a latent, non-interpretable space where the perturbation is carried out subsequently. This article offers a comparative study of the performance of these two families of methods on three classical NLP tasks. Our empirical evidence shows that opaque approaches can be an overkill for downstream applications such as fake news detection or sentiment analysis since they add an additional level of complexity with no significant performance gain. These observations motivate our discussion, which raises the question of whether it makes sense to explain a black box using another black box.

* This article was originally published in French at the Journal TAL. VOL 64 n{\deg}3/2023. arXiv admin note: substantial text overlap with arXiv:2402.10888

Via

Access Paper or Ask Questions

Shaping Up SHAP: Enhancing Stability through Layer-Wise Neighbor Selection

Dec 19, 2023

Gwladys Kelodjou, Laurence Rozé, Véronique Masson, Luis Galárraga, Romaric Gaudel, Maurice Tchuente, Alexandre Termier

Figure 1 for Shaping Up SHAP: Enhancing Stability through Layer-Wise Neighbor Selection

Figure 2 for Shaping Up SHAP: Enhancing Stability through Layer-Wise Neighbor Selection

Figure 3 for Shaping Up SHAP: Enhancing Stability through Layer-Wise Neighbor Selection

Figure 4 for Shaping Up SHAP: Enhancing Stability through Layer-Wise Neighbor Selection

Abstract:Machine learning techniques, such as deep learning and ensemble methods, are widely used in various domains due to their ability to handle complex real-world tasks. However, their black-box nature has raised multiple concerns about the fairness, trustworthiness, and transparency of computer-assisted decision-making. This has led to the emergence of local post-hoc explainability methods, which offer explanations for individual decisions made by black-box algorithms. Among these methods, Kernel SHAP is widely used due to its model-agnostic nature and its well-founded theoretical framework. Despite these strengths, Kernel SHAP suffers from high instability: different executions of the method with the same inputs can lead to significantly different explanations, which diminishes the utility of post-hoc explainability. The contribution of this paper is two-fold. On the one hand, we show that Kernel SHAP's instability is caused by its stochastic neighbor selection procedure, which we adapt to achieve full stability without compromising explanation fidelity. On the other hand, we show that by restricting the neighbors generation to perturbations of size 1 -- which we call the coalitions of Layer 1 -- we obtain a novel feature-attribution method that is fully stable, efficient to compute, and still meaningful.

* To appear in AAAI-24

Via

Access Paper or Ask Questions

Effects of Locality and Rule Language on Explanations for Knowledge Graph Embeddings

Feb 14, 2023

Luis Galárraga

Abstract:Knowledge graphs (KGs) are key tools in many AI-related tasks such as reasoning or question answering. This has, in turn, propelled research in link prediction in KGs, the task of predicting missing relationships from the available knowledge. Solutions based on KG embeddings have shown promising results in this matter. On the downside, these approaches are usually unable to explain their predictions. While some works have proposed to compute post-hoc rule explanations for embedding-based link predictors, these efforts have mostly resorted to rules with unbounded atoms, e.g., bornIn(x,y) => residence(x,y), learned on a global scope, i.e., the entire KG. None of these works has considered the impact of rules with bounded atoms such as nationality(x,England) => speaks(x, English), or the impact of learning from regions of the KG, i.e., local scopes. We therefore study the effects of these factors on the quality of rule-based explanations for embedding-based link predictors. Our results suggest that more specific rules and local scopes can improve the accuracy of the explanations. Moreover, these rules can provide further insights about the inner-workings of KG embeddings for link prediction.

* Full paper at the Symposium on Intelligent Data Analysis (IDA) 2023

Via

Access Paper or Ask Questions

s-LIME: Reconciling Locality and Fidelity in Linear Explanations

Aug 02, 2022

Romaric Gaudel, Luis Galárraga, Julien Delaunay, Laurence Rozé, Vaishnavi Bhargava

Abstract:The benefit of locality is one of the major premises of LIME, one of the most prominent methods to explain black-box machine learning models. This emphasis relies on the postulate that the more locally we look at the vicinity of an instance, the simpler the black-box model becomes, and the more accurately we can mimic it with a linear surrogate. As logical as this seems, our findings suggest that, with the current design of LIME, the surrogate model may degenerate when the explanation is too local, namely, when the bandwidth parameter $\sigma$ tends to zero. Based on this observation, the contribution of this paper is twofold. Firstly, we study the impact of both the bandwidth and the training vicinity on the fidelity and semantics of LIME explanations. Secondly, and based on our findings, we propose \slime, an extension of LIME that reconciles fidelity and locality.

* Symposium on Intelligent Data Analysis (IDA'22), Apr 2022, Rennes, France

Via

Access Paper or Ask Questions

Discovering Useful Compact Sets of Sequential Rules in a Long Sequence

Sep 15, 2021

Erwan Bourrand, Luis Galárraga, Esther Galbrun, Elisa Fromont, Alexandre Termier

Figure 1 for Discovering Useful Compact Sets of Sequential Rules in a Long Sequence

Figure 2 for Discovering Useful Compact Sets of Sequential Rules in a Long Sequence

Figure 3 for Discovering Useful Compact Sets of Sequential Rules in a Long Sequence

Figure 4 for Discovering Useful Compact Sets of Sequential Rules in a Long Sequence

Abstract:We are interested in understanding the underlying generation process for long sequences of symbolic events. To do so, we propose COSSU, an algorithm to mine small and meaningful sets of sequential rules. The rules are selected using an MDL-inspired criterion that favors compactness and relies on a novel rule-based encoding scheme for sequences. Our evaluation shows that COSSU can successfully retrieve relevant sets of closed sequential rules from a long sequence. Such rules constitute an interpretable model that exhibits competitive accuracy for the tasks of next-element prediction and classification.

* 8 pages, published in the proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence

Via

Access Paper or Ask Questions

HiPaR: Hierarchical Pattern-aided Regression

Feb 24, 2021

Luis Galárraga, Olivier Pelgrin, Alexandre Termier

Figure 1 for HiPaR: Hierarchical Pattern-aided Regression

Figure 2 for HiPaR: Hierarchical Pattern-aided Regression

Figure 3 for HiPaR: Hierarchical Pattern-aided Regression

Figure 4 for HiPaR: Hierarchical Pattern-aided Regression

Abstract:We introduce HiPaR, a novel pattern-aided regression method for tabular data containing both categorical and numerical attributes. HiPaR mines hybrid rules of the form $p \Rightarrow y = f(X)$ where $p$ is the characterization of a data region and $f(X)$ is a linear regression model on a variable of interest $y$. HiPaR relies on pattern mining techniques to identify regions of the data where the target variable can be accurately explained via local linear models. The novelty of the method lies in the combination of an enumerative approach to explore the space of regions and efficient heuristics that guide the search. Such a strategy provides more flexibility when selecting a small set of jointly accurate and human-readable hybrid rules that explain the entire dataset. As our experiments shows, HiPaR mines fewer rules than existing pattern-based regression methods while still attaining state-of-the-art prediction performance.

* Proceedings of the 25th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2021)

Via

Access Paper or Ask Questions

REMI: Mining Intuitive Referring Expressions on Knowledge Bases

Nov 04, 2019

Luis Galárraga, Julien Delaunay, Jean-Louis Dessalles

Figure 1 for REMI: Mining Intuitive Referring Expressions on Knowledge Bases

Figure 2 for REMI: Mining Intuitive Referring Expressions on Knowledge Bases

Figure 3 for REMI: Mining Intuitive Referring Expressions on Knowledge Bases

Figure 4 for REMI: Mining Intuitive Referring Expressions on Knowledge Bases

Abstract:A referring expression (RE) is a description that identifies a set of instances unambiguously. Mining REs from data finds applications in natural language generation, algorithmic journalism, and data maintenance. Since there may exist multiple REs for a given set of entities, it is common to focus on the most intuitive ones, i.e., the most concise and informative. In this paper we present REMI, a system that can mine intuitive REs on large RDF knowledge bases. Our experimental evaluation shows that REMI finds REs deemed intuitive by users. Moreover we show that REMI is several orders of magnitude faster than an approach based on inductive logic programming.

Via

Access Paper or Ask Questions