Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Muni Sreenivas Pydi

MILES, LAMSADE

Memorization in Fine-Tuned Large Language Models

Jul 28, 2025

Danil Savine, Muni Sreenivas Pydi, Jamal Atif, Olivier Cappé

Abstract:This study investigates the mechanisms and factors influencing memorization in fine-tuned large language models (LLMs), with a focus on the medical domain due to its privacy-sensitive nature. We examine how different aspects of the fine-tuning process affect a model's propensity to memorize training data, using the PHEE dataset of pharmacovigilance events. Our research employs two main approaches: a membership inference attack to detect memorized data, and a generation task with prompted prefixes to assess verbatim reproduction. We analyze the impact of adapting different weight matrices in the transformer architecture, the relationship between perplexity and memorization, and the effect of increasing the rank in low-rank adaptation (LoRA) fine-tuning. Key findings include: (1) Value and Output matrices contribute more significantly to memorization compared to Query and Key matrices; (2) Lower perplexity in the fine-tuned model correlates with increased memorization; (3) Higher LoRA ranks lead to increased memorization, but with diminishing returns at higher ranks. These results provide insights into the trade-offs between model performance and privacy risks in fine-tuned LLMs. Our findings have implications for developing more effective and responsible strategies for adapting large language models while managing data privacy concerns.

Via

Access Paper or Ask Questions

Unveiling the Role of Randomization in Multiclass Adversarial Classification: Insights from Graph Theory

Mar 18, 2025

Lucas Gnecco-Heredia, Matteo Sammut, Muni Sreenivas Pydi, Rafael Pinot, Benjamin Negrevergne, Yann Chevaleyre

Abstract:Randomization as a mean to improve the adversarial robustness of machine learning models has recently attracted significant attention. Unfortunately, much of the theoretical analysis so far has focused on binary classification, providing only limited insights into the more complex multiclass setting. In this paper, we take a step toward closing this gap by drawing inspiration from the field of graph theory. Our analysis focuses on discrete data distributions, allowing us to cast the adversarial risk minimization problems within the well-established framework of set packing problems. By doing so, we are able to identify three structural conditions on the support of the data distribution that are necessary for randomization to improve robustness. Furthermore, we are able to construct several data distributions where (contrarily to binary classification) switching from a deterministic to a randomized solution significantly reduces the optimal adversarial risk. These findings highlight the crucial role randomization can play in enhancing robustness to adversarial attacks in multiclass classification.

* 9 pages (main), 30 in total. Camera-ready version, accepted at AISTATS 2025

Via

Access Paper or Ask Questions

Memorization in Attention-only Transformers

Nov 15, 2024

Léo Dana, Muni Sreenivas Pydi, Yann Chevaleyre

Figure 1 for Memorization in Attention-only Transformers

Figure 2 for Memorization in Attention-only Transformers

Figure 3 for Memorization in Attention-only Transformers

Figure 4 for Memorization in Attention-only Transformers

Abstract:Recent research has explored the memorization capacity of multi-head attention, but these findings are constrained by unrealistic limitations on the context size. We present a novel proof for language-based Transformers that extends the current hypothesis to any context size. Our approach improves upon the state-of-the-art by achieving more effective exact memorization with an attention layer, while also introducing the concept of approximate memorization of distributions. Through experimental validation, we demonstrate that our proposed bounds more accurately reflect the true memorization capacity of language models, and provide a precise comparison with prior work.

* 16 pages, 6 figures, submitted to AISTATS 2025,

Via

Access Paper or Ask Questions

Optimal Classification under Performative Distribution Shift

Nov 04, 2024

Edwige Cyffers, Muni Sreenivas Pydi, Jamal Atif, Olivier Cappé

Figure 1 for Optimal Classification under Performative Distribution Shift

Figure 2 for Optimal Classification under Performative Distribution Shift

Figure 3 for Optimal Classification under Performative Distribution Shift

Figure 4 for Optimal Classification under Performative Distribution Shift

Abstract:Performative learning addresses the increasingly pervasive situations in which algorithmic decisions may induce changes in the data distribution as a consequence of their public deployment. We propose a novel view in which these performative effects are modelled as push-forward measures. This general framework encompasses existing models and enables novel performative gradient estimation methods, leading to more efficient and scalable learning strategies. For distribution shifts, unlike previous models which require full specification of the data distribution, we only assume knowledge of the shift operator that represents the performative changes. This approach can also be integrated into various change-of-variablebased models, such as VAEs or normalizing flows. Focusing on classification with a linear-in-parameters performative effect, we prove the convexity of the performative risk under a new set of assumptions. Notably, we do not limit the strength of performative effects but rather their direction, requiring only that classification becomes harder when deploying more accurate models. In this case, we also establish a connection with adversarially robust classification by reformulating the minimization of the performative risk as a min-max variational problem. Finally, we illustrate our approach on synthetic and real datasets.

* 38th Conference on Neural Information Processing Systems, Dec 2024, Vancouver (Canada), Canada

Via

Access Paper or Ask Questions

Optimal Budgeted Rejection Sampling for Generative Models

Nov 01, 2023

Alexandre Verine, Muni Sreenivas Pydi, Benjamin Negrevergne, Yann Chevaleyre

Figure 1 for Optimal Budgeted Rejection Sampling for Generative Models

Figure 2 for Optimal Budgeted Rejection Sampling for Generative Models

Figure 3 for Optimal Budgeted Rejection Sampling for Generative Models

Figure 4 for Optimal Budgeted Rejection Sampling for Generative Models

Abstract:Rejection sampling methods have recently been proposed to improve the performance of discriminator-based generative models. However, these methods are only optimal under an unlimited sampling budget, and are usually applied to a generator trained independently of the rejection procedure. We first propose an Optimal Budgeted Rejection Sampling (OBRS) scheme that is provably optimal with respect to \textit{any} $f$-divergence between the true distribution and the post-rejection distribution, for a given sampling budget. Second, we propose an end-to-end method that incorporates the sampling scheme into the training procedure to further enhance the model's overall performance. Through experiments and supporting theory, we show that the proposed methods are effective in significantly improving the quality and diversity of the samples.

Via

Access Paper or Ask Questions

Precision-Recall Divergence Optimization for Generative Modeling with GANs and Normalizing Flows

May 30, 2023

Alexandre Verine, Benjamin Negrevergne, Muni Sreenivas Pydi, Yann Chevaleyre

Figure 1 for Precision-Recall Divergence Optimization for Generative Modeling with GANs and Normalizing Flows

Figure 2 for Precision-Recall Divergence Optimization for Generative Modeling with GANs and Normalizing Flows

Figure 3 for Precision-Recall Divergence Optimization for Generative Modeling with GANs and Normalizing Flows

Figure 4 for Precision-Recall Divergence Optimization for Generative Modeling with GANs and Normalizing Flows

Abstract:Achieving a balance between image quality (precision) and diversity (recall) is a significant challenge in the domain of generative models. Current state-of-the-art models primarily rely on optimizing heuristics, such as the Fr\'echet Inception Distance. While recent developments have introduced principled methods for evaluating precision and recall, they have yet to be successfully integrated into the training of generative models. Our main contribution is a novel training method for generative models, such as Generative Adversarial Networks and Normalizing Flows, which explicitly optimizes a user-defined trade-off between precision and recall. More precisely, we show that achieving a specified precision-recall trade-off corresponds to minimizing a unique $f$-divergence from a family we call the \mbox{\em PR-divergences}. Conversely, any $f$-divergence can be written as a linear combination of PR-divergences and corresponds to a weighted precision-recall trade-off. Through comprehensive evaluations, we show that our approach improves the performance of existing state-of-the-art models like BigGAN in terms of either precision or recall when tested on datasets such as ImageNet.

Via

Access Paper or Ask Questions

Training Normalizing Flows with the Precision-Recall Divergence

Feb 02, 2023

Alexandre Verine, Benjamin Negrevergne, Muni Sreenivas Pydi, Yann Chevaleyre

Figure 1 for Training Normalizing Flows with the Precision-Recall Divergence

Figure 2 for Training Normalizing Flows with the Precision-Recall Divergence

Figure 3 for Training Normalizing Flows with the Precision-Recall Divergence

Figure 4 for Training Normalizing Flows with the Precision-Recall Divergence

Abstract:Generative models can have distinct mode of failures like mode dropping and low quality samples, which cannot be captured by a single scalar metric. To address this, recent works propose evaluating generative models using precision and recall, where precision measures quality of samples and recall measures the coverage of the target distribution. Although a variety of discrepancy measures between the target and estimated distribution are used to train generative models, it is unclear what precision-recall trade-offs are achieved by various choices of the discrepancy measures. In this paper, we show that achieving a specified precision-recall trade-off corresponds to minimising -divergences from a family we call the {\em PR-divergences }. Conversely, any -divergence can be written as a linear combination of PR-divergences and therefore correspond to minimising a weighted precision-recall trade-off. Further, we propose a novel generative model that is able to train a normalizing flow to minimise any -divergence, and in particular, achieve a given precision-recall trade-off.

Via

Access Paper or Ask Questions

Robust empirical risk minimization via Newton's method

Jan 30, 2023

Eirini Ioannou, Muni Sreenivas Pydi, Po-Ling Loh

Figure 1 for Robust empirical risk minimization via Newton's method

Figure 2 for Robust empirical risk minimization via Newton's method

Figure 3 for Robust empirical risk minimization via Newton's method

Figure 4 for Robust empirical risk minimization via Newton's method

Abstract:We study a variant of Newton's method for empirical risk minimization, where at each iteration of the optimization algorithm, we replace the gradient and Hessian of the objective function by robust estimators taken from existing literature on robust mean estimation for multivariate data. After proving a general theorem about the convergence of successive iterates to a small ball around the population-level minimizer, we study consequences of our theory in generalized linear models, when data are generated from Huber's epsilon-contamination model and/or heavy-tailed distributions. We also propose an algorithm for obtaining robust Newton directions based on the conjugate gradient method, which may be more appropriate for high-dimensional settings, and provide conjectures about the convergence of the resulting algorithm. Compared to the robust gradient descent algorithm proposed by Prasad et al. (2020), our algorithm enjoys the faster rates of convergence for successive iterates often achieved by second-order algorithms for convex problems, i.e., quadratic convergence in a neighborhood of the optimum, with a stepsize that may be chosen adaptively via backtracking linesearch.

Via

Access Paper or Ask Questions

The Many Faces of Adversarial Risk

Jan 22, 2022

Muni Sreenivas Pydi, Varun Jog

Figure 1 for The Many Faces of Adversarial Risk

Figure 2 for The Many Faces of Adversarial Risk

Figure 3 for The Many Faces of Adversarial Risk

Abstract:Adversarial risk quantifies the performance of classifiers on adversarially perturbed data. Numerous definitions of adversarial risk -- not all mathematically rigorous and differing subtly in the details -- have appeared in the literature. In this paper, we revisit these definitions, make them rigorous, and critically examine their similarities and differences. Our technical tools derive from optimal transport, robust statistics, functional analysis, and game theory. Our contributions include the following: generalizing Strassen's theorem to the unbalanced optimal transport setting with applications to adversarial classification with unequal priors; showing an equivalence between adversarial robustness and robust hypothesis testing with $\infty$-Wasserstein uncertainty sets; proving the existence of a pure Nash equilibrium in the two-player game between the adversary and the algorithm; and characterizing adversarial risk by the minimum Bayes error between a pair of distributions belonging to the $\infty$-Wasserstein uncertainty sets. Our results generalize and deepen recently discovered connections between optimal transport and adversarial robustness and reveal new connections to Choquet capacities and game theory.

* A version of this paper was presented at NeurIPS 2021

Via

Access Paper or Ask Questions

Adversarial Risk via Optimal Transport and Optimal Couplings

Dec 05, 2019

Muni Sreenivas Pydi, Varun Jog

Figure 1 for Adversarial Risk via Optimal Transport and Optimal Couplings

Figure 2 for Adversarial Risk via Optimal Transport and Optimal Couplings

Figure 3 for Adversarial Risk via Optimal Transport and Optimal Couplings

Figure 4 for Adversarial Risk via Optimal Transport and Optimal Couplings

Abstract:The accuracy of modern machine learning algorithms deteriorates severely on adversarially manipulated test data. Optimal adversarial risk quantifies the best error rate of any classifier in the presence of adversaries, and optimal adversarial classifiers are sought that minimize adversarial risk. In this paper, we investigate the optimal adversarial risk and optimal adversarial classifiers from an optimal transport perspective. We present a new and simple approach to show that the optimal adversarial risk for binary classification with $0-1$ loss function is completely characterized by an optimal transport cost between the probability distributions of the two classes, for a suitably defined cost function. We propose a novel coupling strategy that achieves the optimal transport cost for several univariate distributions like Gaussian, uniform and triangular. Using the optimal couplings, we obtain the optimal adversarial classifiers in these settings and show how they differ from optimal classifiers in the absence of adversaries. Based on our analysis, we evaluate algorithm-independent fundamental limits on adversarial risk for CIFAR-10, MNIST, Fashion-MNIST and SVHN datasets, and Gaussian mixtures based on them. In addition to the $0-1$ loss, we also derive bounds on the deviation of optimal risk and optimal classifier in the presence of adversaries for continuous loss functions, that are based on the convexity and smoothness of the loss functions.

Via

Access Paper or Ask Questions