Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:DP-MLM: Differentially Private Text Rewriting Using Masked Language Models

Jun 30, 2024

Stephen Meisenbacher, Maulik Chevli, Juraj Vladika, Florian Matthes

Figure 1 for DP-MLM: Differentially Private Text Rewriting Using Masked Language Models

Figure 2 for DP-MLM: Differentially Private Text Rewriting Using Masked Language Models

Figure 3 for DP-MLM: Differentially Private Text Rewriting Using Masked Language Models

Figure 4 for DP-MLM: Differentially Private Text Rewriting Using Masked Language Models

Share this with someone who'll enjoy it:

Abstract:The task of text privatization using Differential Privacy has recently taken the form of $\textit{text rewriting}$, in which an input text is obfuscated via the use of generative (large) language models. While these methods have shown promising results in the ability to preserve privacy, these methods rely on autoregressive models which lack a mechanism to contextualize the private rewriting process. In response to this, we propose $\textbf{DP-MLM}$, a new method for differentially private text rewriting based on leveraging masked language models (MLMs) to rewrite text in a semantically similar $\textit{and}$ obfuscated manner. We accomplish this with a simple contextualization technique, whereby we rewrite a text one token at a time. We find that utilizing encoder-only MLMs provides better utility preservation at lower $\varepsilon$ levels, as compared to previous methods relying on larger models with a decoder. In addition, MLMs allow for greater customization of the rewriting mechanism, as opposed to generative approaches. We make the code for $\textbf{DP-MLM}$ public and reusable, found at https://github.com/sjmeis/DPMLM .

* 15 pages, 2 figures, 8 tables. Accepted to ACL 2024 (Findings)

View paper on

Share this with someone who'll enjoy it:

Title:DP-MLM: Differentially Private Text Rewriting Using Masked Language Models

Paper and Code