Understanding and Mitigating Memorization in Generative Models via Sharpness of Probability Landscapes
About
In this paper, we introduce a geometric framework to analyze memorization in diffusion models through the sharpness of the log probability density. We mathematically justify a previously proposed score-difference-based memorization metric by demonstrating its effectiveness in quantifying sharpness. Additionally, we propose a novel memorization metric that captures sharpness at the initial stage of image generation in latent diffusion models, offering early insights into potential memorization. Leveraging this metric, we develop a mitigation strategy that optimizes the initial noise of the generation process using a sharpness-aware regularization term. The code is publicly available at https://github.com/Dongjae0324/sharpness_memorization_diffusion.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Memorization Detection | Stable Diffusion V1.4 | AUC0.998 | 28 | |
| Memorization Detection | SD 2.1 | AUC0.9909 | 24 | |
| Memorization Detection | SD LAION Lexica COCO-2017 GPT-4 1000 prompts 1.4 (Evaluation) | AUC0.9994 | 16 | |
| Memorization mitigation | Stable Diffusion 1.4 | Memorization Rate34.8 | 13 | |
| Memorization Detection | SD First 3 Steps 1.4 | AUC99.94 | 9 | |
| Memorization Detection | Stable Diffusion 1.5 | -- | 9 | |
| Memorization Detection | Stable Diffusion v2.0 | AUC0.991 | 8 | |
| Memorization mitigation | Stable Diffusion finetuned 1.4 | Memorization Rate0.00e+0 | 7 | |
| Memorization mitigation | SD 1.4 (val) | Inference Time (s)50.469 | 7 |