Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Antoine Saillenfest

Nonlinear Concept Erasure: a Density Matching Approach

Jul 16, 2025

Antoine Saillenfest, Pirmin Lemberger

Abstract:Ensuring that neural models used in real-world applications cannot infer sensitive information, such as demographic attributes like gender or race, from text representations is a critical challenge when fairness is a concern. We address this issue through concept erasure, a process that removes information related to a specific concept from distributed representations while preserving as much of the remaining semantic information as possible. Our approach involves learning an orthogonal projection in the embedding space, designed to make the class-conditional feature distributions of the discrete concept to erase indistinguishable after projection. By adjusting the rank of the projector, we control the extent of information removal, while its orthogonality ensures strict preservation of the local structure of the embeddings. Our method, termed $\overline{\mathrm{L}}$EOPARD, achieves state-of-the-art performance in nonlinear erasure of a discrete attribute on classic natural language processing benchmarks. Furthermore, we demonstrate that $\overline{\mathrm{L}}$EOPARD effectively mitigates bias in deep nonlinear classifiers, thereby promoting fairness.

* 17 pages, 10 figures, accepted for publication in ECAI 2025 (28th European Conference on Artificial Intelligence)

Via

Access Paper or Ask Questions

Revisiting Hierarchical Text Classification: Inference and Metrics

Oct 02, 2024

Roman Plaud, Matthieu Labeau, Antoine Saillenfest, Thomas Bonald

Figure 1 for Revisiting Hierarchical Text Classification: Inference and Metrics

Figure 2 for Revisiting Hierarchical Text Classification: Inference and Metrics

Figure 3 for Revisiting Hierarchical Text Classification: Inference and Metrics

Figure 4 for Revisiting Hierarchical Text Classification: Inference and Metrics

Abstract:Hierarchical text classification (HTC) is the task of assigning labels to a text within a structured space organized as a hierarchy. Recent works treat HTC as a conventional multilabel classification problem, therefore evaluating it as such. We instead propose to evaluate models based on specifically designed hierarchical metrics and we demonstrate the intricacy of metric choice and prediction inference method. We introduce a new challenging dataset and we evaluate fairly, recent sophisticated models, comparing them with a range of simple but strong baselines, including a new theoretically motivated loss. Finally, we show that those baselines are very often competitive with the latest models. This highlights the importance of carefully considering the evaluation methodology when proposing new methods for HTC. Code implementation and dataset are available at \url{https://github.com/RomanPlaud/revisitingHTC}.

* Accepted at CoNLL 2024

Via

Access Paper or Ask Questions

Explaining Text Classifiers with Counterfactual Representations

Feb 01, 2024

Pirmin Lemberger, Antoine Saillenfest

Figure 1 for Explaining Text Classifiers with Counterfactual Representations

Figure 2 for Explaining Text Classifiers with Counterfactual Representations

Figure 3 for Explaining Text Classifiers with Counterfactual Representations

Figure 4 for Explaining Text Classifiers with Counterfactual Representations

Abstract:One well motivated explanation method for classifiers leverages counterfactuals which are hypothetical events identical to real observations in all aspects except for one categorical feature. Constructing such counterfactual poses specific challenges for texts, however, as some attribute values may not necessarily align with plausible real-world events. In this paper we propose a simple method for generating counterfactuals by intervening in the space of text representations which bypasses this limitation. We argue that our interventions are minimally disruptive and that they are theoretically sound as they align with counterfactuals as defined in Pearl's causal inference framework. To validate our method, we first conduct experiments on a synthetic dataset of counterfactuals, allowing for a direct comparison between classifier predictions based on ground truth counterfactuals (obtained through explicit text interventions) and our counterfactuals, derived through interventions in the representation space. Second, we study a real world scenario where our counterfactuals can be leveraged both for explaining a classifier and for bias mitigation.

* 12 pages, 4 figures

Via

Access Paper or Ask Questions

Fair Evaluation of Graph Markov Neural Networks

Apr 03, 2023

Pirmin Lemberger, Antoine Saillenfest

Figure 1 for Fair Evaluation of Graph Markov Neural Networks

Figure 2 for Fair Evaluation of Graph Markov Neural Networks

Figure 3 for Fair Evaluation of Graph Markov Neural Networks

Figure 4 for Fair Evaluation of Graph Markov Neural Networks

Abstract:Graph Markov Neural Networks (GMNN) have recently been proposed to improve regular graph neural networks (GNN) by including label dependencies into the semi-supervised node classification task. GMNNs do this in a theoretically principled way and use three kinds of information to predict labels. Just like ordinary GNNs, they use the node features and the graph structure but they moreover leverage information from the labels of neighboring nodes to improve the accuracy of their predictions. In this paper, we introduce a new dataset named WikiVitals which contains a graph of 48k mutually referred Wikipedia articles classified into 32 categories and connected by 2.3M edges. Our aim is to rigorously evaluate the contributions of three distinct sources of information to the prediction accuracy of GMNN for this dataset: the content of the articles, their connections with each other and the correlations among their labels. For this purpose we adapt a method which was recently proposed for performing fair comparisons of GNN performance using an appropriate randomization over partitions and a clear separation of model selection and model assessment.

* 11 pages, 2 figures

Via

Access Paper or Ask Questions

Role of Simplicity in Creative Behaviour: The Case of the Poietic Generator

Dec 22, 2016

Antoine Saillenfest, Jean-Louis Dessalles, Olivier Auber

Figure 1 for Role of Simplicity in Creative Behaviour: The Case of the Poietic Generator

Figure 2 for Role of Simplicity in Creative Behaviour: The Case of the Poietic Generator

Figure 3 for Role of Simplicity in Creative Behaviour: The Case of the Poietic Generator

Figure 4 for Role of Simplicity in Creative Behaviour: The Case of the Poietic Generator

Abstract:We propose to apply Simplicity Theory (ST) to model interest in creative situations. ST has been designed to describe and predict interest in communication. Here we use ST to derive a decision rule that we apply to a simplified version of a creative game, the Poietic Generator. The decision rule produces what can be regarded as an elementary form of creativity. This study is meant as a proof of principle. It suggests that some creative actions may be motivated by the search for unexpected simplicity.

* Proceedings of the Seventh International Conference on Computational Creativity (ICCC-2016). Paris, France
* This study was supported by grants from the programme Futur&Ruptures and from the 'Chaire Modelisation des Imaginaires, Innovation et Creation', http://www.computationalcreativity.net/iccc2016/posters-and-demos/

Via

Access Paper or Ask Questions