Broken Memories: Detecting and Mitigating Memorization in Diffusion Models with Degraded Generations
About
While diffusion models excel at generating high-quality images, their tendency to memorize training data poses significant privacy and copyright risks. In this work, we for the first time identify that memorization induces internal numerical instability, often manifesting as visually ``broken'' artifacts. Inspired by stability analysis in numerical methods, we introduce empirical stability regions based on latent update norms to quantitatively characterize stable behavior during generation. Leveraging this, we propose a principled, on-the-fly framework for step-wise detection and adaptive mitigation. Our approach suppresses memorization without altering prompts or guidance, thereby preserving semantic fidelity and image quality. Extensive experiments on Stable Diffusion 1.4 demonstrate that our method achieves an AUC $>0.999$ detection performance and a $0.0\%$ memorization rate after mitigation with negligible overhead ($\approx0.01$s per image).
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Memorization Detection | Stable Diffusion V1.4 | AUC0.9995 | 28 | |
| Memorization Detection | SD 2.1 | AUC0.9962 | 24 | |
| Memorization Detection | SD LAION Lexica COCO-2017 GPT-4 1000 prompts 1.4 (Evaluation) | AUC0.9997 | 16 | |
| Memorization mitigation | Stable Diffusion 1.4 | Memorization Rate0.00e+0 | 13 | |
| Memorization Detection | Stable Diffusion 1.5 | AUC (All Steps)99.97 | 9 | |
| Memorization Detection | SD First 3 Steps 1.4 | AUC99.82 | 9 | |
| Memorization mitigation | Stable Diffusion finetuned 1.4 | Memorization Rate1.5 | 7 | |
| Memorization mitigation | SD 1.4 (val) | Inference Time (s)1.757 | 7 | |
| Memorization Detection | SD Avg. on Steps 1.4 | AUC99.92 | 6 | |
| Diffusion Model Memorization Detection and Mitigation | LAION 400M | AUC0.999 | 1 |