Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yann Chevaleyre

Lattice Climber Attack: Adversarial attacks for randomized mixtures of classifiers

Jun 12, 2025

Lucas Gnecco-Heredia, Benjamin Negrevergne, Yann Chevaleyre

Abstract:Finite mixtures of classifiers (a.k.a. randomized ensembles) have been proposed as a way to improve robustness against adversarial attacks. However, existing attacks have been shown to not suit this kind of classifier. In this paper, we discuss the problem of attacking a mixture in a principled way and introduce two desirable properties of attacks based on a geometrical analysis of the problem (effectiveness and maximality). We then show that existing attacks do not meet both of these properties. Finally, we introduce a new attack called {\em lattice climber attack} with theoretical guarantees in the binary linear setting, and demonstrate its performance by conducting experiments on synthetic and real datasets.

* 17 pages including bibliography + 13 pages of supplementary material. Extended version of the article accepted at ECML 2025

Via

Access Paper or Ask Questions

Improving Discriminator Guidance in Diffusion Models

Mar 20, 2025

Alexandre Verine, Mehdi Inane, Florian Le Bronnec, Benjamin Negrevergne, Yann Chevaleyre

Abstract:Discriminator Guidance has become a popular method for efficiently refining pre-trained Score-Matching Diffusion models. However, in this paper, we demonstrate that the standard implementation of this technique does not necessarily lead to a distribution closer to the real data distribution. Specifically, we show that training the discriminator using Cross-Entropy loss, as commonly done, can in fact increase the Kullback-Leibler divergence between the model and target distributions, particularly when the discriminator overfits. To address this, we propose a theoretically sound training objective for discriminator guidance that properly minimizes the KL divergence. We analyze its properties and demonstrate empirically across multiple datasets that our proposed method consistently improves over the conventional method by producing samples of higher quality.

Via

Access Paper or Ask Questions

Unveiling the Role of Randomization in Multiclass Adversarial Classification: Insights from Graph Theory

Mar 18, 2025

Lucas Gnecco-Heredia, Matteo Sammut, Muni Sreenivas Pydi, Rafael Pinot, Benjamin Negrevergne, Yann Chevaleyre

Abstract:Randomization as a mean to improve the adversarial robustness of machine learning models has recently attracted significant attention. Unfortunately, much of the theoretical analysis so far has focused on binary classification, providing only limited insights into the more complex multiclass setting. In this paper, we take a step toward closing this gap by drawing inspiration from the field of graph theory. Our analysis focuses on discrete data distributions, allowing us to cast the adversarial risk minimization problems within the well-established framework of set packing problems. By doing so, we are able to identify three structural conditions on the support of the data distribution that are necessary for randomization to improve robustness. Furthermore, we are able to construct several data distributions where (contrarily to binary classification) switching from a deterministic to a randomized solution significantly reduces the optimal adversarial risk. These findings highlight the crucial role randomization can play in enhancing robustness to adversarial attacks in multiclass classification.

* 9 pages (main), 30 in total. Camera-ready version, accepted at AISTATS 2025

Via

Access Paper or Ask Questions

Memorization in Attention-only Transformers

Nov 15, 2024

Léo Dana, Muni Sreenivas Pydi, Yann Chevaleyre

Figure 1 for Memorization in Attention-only Transformers

Figure 2 for Memorization in Attention-only Transformers

Figure 3 for Memorization in Attention-only Transformers

Figure 4 for Memorization in Attention-only Transformers

Abstract:Recent research has explored the memorization capacity of multi-head attention, but these findings are constrained by unrealistic limitations on the context size. We present a novel proof for language-based Transformers that extends the current hypothesis to any context size. Our approach improves upon the state-of-the-art by achieving more effective exact memorization with an attention layer, while also introducing the concept of approximate memorization of distributions. Through experimental validation, we demonstrate that our proposed bounds more accurately reflect the true memorization capacity of language models, and provide a precise comparison with prior work.

* 16 pages, 6 figures, submitted to AISTATS 2025,

Via

Access Paper or Ask Questions

Exploring Precision and Recall to assess the quality and diversity of LLMs

Feb 28, 2024

Florian Le Bronnec, Alexandre Verine, Benjamin Negrevergne, Yann Chevaleyre, Alexandre Allauzen

Figure 1 for Exploring Precision and Recall to assess the quality and diversity of LLMs

Figure 2 for Exploring Precision and Recall to assess the quality and diversity of LLMs

Figure 3 for Exploring Precision and Recall to assess the quality and diversity of LLMs

Figure 4 for Exploring Precision and Recall to assess the quality and diversity of LLMs

Abstract:This paper introduces a novel evaluation framework for Large Language Models (LLMs) such as Llama-2 and Mistral, focusing on the adaptation of Precision and Recall metrics from image generation to text generation. This approach allows for a nuanced assessment of the quality and diversity of generated text without the need for aligned corpora. By conducting a comprehensive evaluation of state-of-the-art language models, the study reveals significant insights into their performance on open-ended generation tasks, which are not adequately captured by traditional benchmarks. The findings highlight a trade-off between the quality and diversity of generated samples, particularly when models are fine-tuned with human feedback. This work extends the toolkit for distribution-based NLP evaluation, offering insights into the practical capabilities and challenges faced by current LLMs in generating diverse and high-quality text.

* 21 pages, 15 figures, Under Review

Via

Access Paper or Ask Questions

Optimal Budgeted Rejection Sampling for Generative Models

Nov 01, 2023

Alexandre Verine, Muni Sreenivas Pydi, Benjamin Negrevergne, Yann Chevaleyre

Figure 1 for Optimal Budgeted Rejection Sampling for Generative Models

Figure 2 for Optimal Budgeted Rejection Sampling for Generative Models

Figure 3 for Optimal Budgeted Rejection Sampling for Generative Models

Figure 4 for Optimal Budgeted Rejection Sampling for Generative Models

Abstract:Rejection sampling methods have recently been proposed to improve the performance of discriminator-based generative models. However, these methods are only optimal under an unlimited sampling budget, and are usually applied to a generator trained independently of the rejection procedure. We first propose an Optimal Budgeted Rejection Sampling (OBRS) scheme that is provably optimal with respect to \textit{any} $f$-divergence between the true distribution and the post-rejection distribution, for a given sampling budget. Second, we propose an end-to-end method that incorporates the sampling scheme into the training procedure to further enhance the model's overall performance. Through experiments and supporting theory, we show that the proposed methods are effective in significantly improving the quality and diversity of the samples.

Via

Access Paper or Ask Questions

Adversarial attacks for mixtures of classifiers

Jul 20, 2023

Lucas Gnecco Heredia, Benjamin Negrevergne, Yann Chevaleyre

Figure 1 for Adversarial attacks for mixtures of classifiers

Figure 2 for Adversarial attacks for mixtures of classifiers

Figure 3 for Adversarial attacks for mixtures of classifiers

Figure 4 for Adversarial attacks for mixtures of classifiers

Abstract:Mixtures of classifiers (a.k.a. randomized ensembles) have been proposed as a way to improve robustness against adversarial attacks. However, it has been shown that existing attacks are not well suited for this kind of classifiers. In this paper, we discuss the problem of attacking a mixture in a principled way and introduce two desirable properties of attacks based on a geometrical analysis of the problem (effectiveness and maximality). We then show that existing attacks do not meet both of these properties. Finally, we introduce a new attack called lattice climber attack with theoretical guarantees on the binary linear setting, and we demonstrate its performance by conducting experiments on synthetic and real datasets.

* 7 pages + 4 pages of appendix. 5 figures in main text

Via

Access Paper or Ask Questions

Precision-Recall Divergence Optimization for Generative Modeling with GANs and Normalizing Flows

May 30, 2023

Alexandre Verine, Benjamin Negrevergne, Muni Sreenivas Pydi, Yann Chevaleyre

Figure 1 for Precision-Recall Divergence Optimization for Generative Modeling with GANs and Normalizing Flows

Figure 2 for Precision-Recall Divergence Optimization for Generative Modeling with GANs and Normalizing Flows

Figure 3 for Precision-Recall Divergence Optimization for Generative Modeling with GANs and Normalizing Flows

Figure 4 for Precision-Recall Divergence Optimization for Generative Modeling with GANs and Normalizing Flows

Abstract:Achieving a balance between image quality (precision) and diversity (recall) is a significant challenge in the domain of generative models. Current state-of-the-art models primarily rely on optimizing heuristics, such as the Fr\'echet Inception Distance. While recent developments have introduced principled methods for evaluating precision and recall, they have yet to be successfully integrated into the training of generative models. Our main contribution is a novel training method for generative models, such as Generative Adversarial Networks and Normalizing Flows, which explicitly optimizes a user-defined trade-off between precision and recall. More precisely, we show that achieving a specified precision-recall trade-off corresponds to minimizing a unique $f$-divergence from a family we call the \mbox{\em PR-divergences}. Conversely, any $f$-divergence can be written as a linear combination of PR-divergences and corresponds to a weighted precision-recall trade-off. Through comprehensive evaluations, we show that our approach improves the performance of existing state-of-the-art models like BigGAN in terms of either precision or recall when tested on datasets such as ImageNet.

Via

Access Paper or Ask Questions

Randomization for adversarial robustness: the Good, the Bad and the Ugly

Feb 14, 2023

Lucas Gnecco-Heredia, Yann Chevaleyre, Benjamin Negrevergne, Laurent Meunier

Figure 1 for Randomization for adversarial robustness: the Good, the Bad and the Ugly

Figure 2 for Randomization for adversarial robustness: the Good, the Bad and the Ugly

Figure 3 for Randomization for adversarial robustness: the Good, the Bad and the Ugly

Figure 4 for Randomization for adversarial robustness: the Good, the Bad and the Ugly

Abstract:Deep neural networks are known to be vulnerable to adversarial attacks: A small perturbation that is imperceptible to a human can easily make a well-trained deep neural network misclassify. To defend against adversarial attacks, randomized classifiers have been proposed as a robust alternative to deterministic ones. In this work we show that in the binary classification setting, for any randomized classifier, there is always a deterministic classifier with better adversarial risk. In other words, randomization is not necessary for robustness. In many common randomization schemes, the deterministic classifiers with better risk are explicitly described: For example, we show that ensembles of classifiers are more robust than mixtures of classifiers, and randomized smoothing is more robust than input noise injection. Finally, experiments confirm our theoretical results with the two families of randomized classifiers we analyze.

* 8 pages + bibliography and appendix, 3 figures. Submitted to ICML 2023

Via

Access Paper or Ask Questions

Training Normalizing Flows with the Precision-Recall Divergence

Feb 02, 2023

Alexandre Verine, Benjamin Negrevergne, Muni Sreenivas Pydi, Yann Chevaleyre

Figure 1 for Training Normalizing Flows with the Precision-Recall Divergence

Figure 2 for Training Normalizing Flows with the Precision-Recall Divergence

Figure 3 for Training Normalizing Flows with the Precision-Recall Divergence

Figure 4 for Training Normalizing Flows with the Precision-Recall Divergence

Abstract:Generative models can have distinct mode of failures like mode dropping and low quality samples, which cannot be captured by a single scalar metric. To address this, recent works propose evaluating generative models using precision and recall, where precision measures quality of samples and recall measures the coverage of the target distribution. Although a variety of discrepancy measures between the target and estimated distribution are used to train generative models, it is unclear what precision-recall trade-offs are achieved by various choices of the discrepancy measures. In this paper, we show that achieving a specified precision-recall trade-off corresponds to minimising -divergences from a family we call the {\em PR-divergences }. Conversely, any -divergence can be written as a linear combination of PR-divergences and therefore correspond to minimising a weighted precision-recall trade-off. Further, we propose a novel generative model that is able to train a normalizing flow to minimise any -divergence, and in particular, achieve a given precision-recall trade-off.

Via

Access Paper or Ask Questions