Understanding and Mitigating Copying in Diffusion Models

About

Images generated by diffusion models like Stable Diffusion are increasingly widespread. Recent works and even lawsuits have shown that these models are prone to replicating their training data, unbeknownst to the user. In this paper, we first analyze this memorization problem in text-to-image diffusion models. While it is widely believed that duplicated images in the training set are responsible for content replication at inference time, we observe that the text conditioning of the model plays a similarly important role. In fact, we see in our experiments that data replication often does not happen for unconditional models, while it is common in the text-conditional case. Motivated by our findings, we then propose several techniques for reducing data replication at both training and inference time by randomizing and augmenting image captions in the training set.

Gowthami Somepalli, Vasu Singla, Micah Goldblum, Jonas Geiping, Tom Goldstein• 2023

Related benchmarks

Task	Dataset	Result
Text-to-Image Generation	Pokémon	CLIP Score32.22	21
Text-to-Image Generation	CelebA-HQ	SSCD0.81	18
Text-to-Image Generation	LAION-Art	SSCD0.76	18
Memorization mitigation	Stable Diffusion 1.4	Memorization Rate30	13
Text-conditional image generation	LAION-10k	CLIP Score0.302	10
Mitigating memorization in conditional diffusion models	Scenario 3 duplicated prompts Stable Diffusion v1.4	Similarity (95pc)0.748	8
Text-to-Image Generation	Scenario 4	Similarity (95th Percentile)0.9049	8
Text-to-Image Generation	Memorized Prompts	SSCD0.58	7
Memorization mitigation	SD 1.4 (val)	Inference Time (s)1.753	7
Text-to-Image Generation	SD template memorization 1.4	SSCD0.617	7

Showing 10 of 17 rows

Other info

Code

Follow for update

@wizwand_team Discord