Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Understanding and Mitigating Copying in Diffusion Models

May 31, 2023

Gowthami Somepalli, Vasu Singla, Micah Goldblum, Jonas Geiping, Tom Goldstein

Figure 1 for Understanding and Mitigating Copying in Diffusion Models

Figure 2 for Understanding and Mitigating Copying in Diffusion Models

Figure 3 for Understanding and Mitigating Copying in Diffusion Models

Figure 4 for Understanding and Mitigating Copying in Diffusion Models

Share this with someone who'll enjoy it:

Abstract:Images generated by diffusion models like Stable Diffusion are increasingly widespread. Recent works and even lawsuits have shown that these models are prone to replicating their training data, unbeknownst to the user. In this paper, we first analyze this memorization problem in text-to-image diffusion models. While it is widely believed that duplicated images in the training set are responsible for content replication at inference time, we observe that the text conditioning of the model plays a similarly important role. In fact, we see in our experiments that data replication often does not happen for unconditional models, while it is common in the text-conditional case. Motivated by our findings, we then propose several techniques for reducing data replication at both training and inference time by randomizing and augmenting image captions in the training set.

* 17 pages, preprint. Code is available at https://github.com/somepago/DCR

View paper on

Share this with someone who'll enjoy it:

Title:Understanding and Mitigating Copying in Diffusion Models

Paper and Code