Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

Jun 20, 2024

Lynn Chua, Badih Ghazi, Yangsibo Huang, Pritish Kamath, Daogao Liu, Pasin Manurangsi, Amer Sinha, Chiyuan Zhang

Figure 1 for Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

Figure 2 for Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

Figure 3 for Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

Figure 4 for Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

Share this with someone who'll enjoy it:

Abstract:Large language models (LLMs) have emerged as powerful tools for tackling complex tasks across diverse domains, but they also raise privacy concerns when fine-tuned on sensitive data due to potential memorization. While differential privacy (DP) offers a promising solution by ensuring models are `almost indistinguishable' with or without any particular privacy unit, current evaluations on LLMs mostly treat each example (text record) as the privacy unit. This leads to uneven user privacy guarantees when contributions per user vary. We therefore study user-level DP motivated by applications where it necessary to ensure uniform privacy protection across users. We present a systematic evaluation of user-level DP for LLM fine-tuning on natural language generation tasks. Focusing on two mechanisms for achieving user-level DP guarantees, Group Privacy and User-wise DP-SGD, we investigate design choices like data selection strategies and parameter tuning for the best privacy-utility tradeoff.

View paper on

Share this with someone who'll enjoy it:

Title:Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

Paper and Code