Understanding and Mitigating Memorization in Generative Models via Sharpness of Probability Landscapes

About

In this paper, we introduce a geometric framework to analyze memorization in diffusion models through the sharpness of the log probability density. We mathematically justify a previously proposed score-difference-based memorization metric by demonstrating its effectiveness in quantifying sharpness. Additionally, we propose a novel memorization metric that captures sharpness at the initial stage of image generation in latent diffusion models, offering early insights into potential memorization. Leveraging this metric, we develop a mitigation strategy that optimizes the initial noise of the generation process using a sharpness-aware regularization term. The code is publicly available at https://github.com/Dongjae0324/sharpness_memorization_diffusion.

Dongjae Jeon, Dueun Kim, Albert No• 2024

Related benchmarks

Task	Dataset	Result
Memorization Detection	Stable Diffusion V1.4	AUC0.998	28
Memorization Detection	SD 2.1	AUC0.9909	24
Memorization Detection	SD LAION Lexica COCO-2017 GPT-4 1000 prompts 1.4 (Evaluation)	AUC0.9994	16
Memorization mitigation	Stable Diffusion 1.4	Memorization Rate34.8	13
Memorization Detection	SD First 3 Steps 1.4	AUC99.94	9
Memorization Detection	Stable Diffusion 1.5	--	9
Memorization Detection	Stable Diffusion v2.0	AUC0.991	8
Memorization mitigation	Stable Diffusion finetuned 1.4	Memorization Rate0.00e+0	7
Memorization mitigation	SD 1.4 (val)	Inference Time (s)50.469	7

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord