Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Broken Memories: Detecting and Mitigating Memorization in Diffusion Models with Degraded Generations

About

While diffusion models excel at generating high-quality images, their tendency to memorize training data poses significant privacy and copyright risks. In this work, we for the first time identify that memorization induces internal numerical instability, often manifesting as visually ``broken'' artifacts. Inspired by stability analysis in numerical methods, we introduce empirical stability regions based on latent update norms to quantitatively characterize stable behavior during generation. Leveraging this, we propose a principled, on-the-fly framework for step-wise detection and adaptive mitigation. Our approach suppresses memorization without altering prompts or guidance, thereby preserving semantic fidelity and image quality. Extensive experiments on Stable Diffusion 1.4 demonstrate that our method achieves an AUC $>0.999$ detection performance and a $0.0\%$ memorization rate after mitigation with negligible overhead ($\approx0.01$s per image).

Yuanmin Huang, Mi Zhang, Chen Chen, Feifei Li, Geng Hong, Xiaoyu You, Min Yang• 2026

Related benchmarks

TaskDatasetResultRank
Memorization DetectionStable Diffusion V1.4
AUC0.9995
28
Memorization DetectionSD 2.1
AUC0.9962
24
Memorization DetectionSD LAION Lexica COCO-2017 GPT-4 1000 prompts 1.4 (Evaluation)
AUC0.9997
16
Memorization mitigationStable Diffusion 1.4
Memorization Rate0.00e+0
13
Memorization DetectionStable Diffusion 1.5
AUC (All Steps)99.97
9
Memorization DetectionSD First 3 Steps 1.4
AUC99.82
9
Memorization mitigationStable Diffusion finetuned 1.4
Memorization Rate1.5
7
Memorization mitigationSD 1.4 (val)
Inference Time (s)1.757
7
Memorization DetectionSD Avg. on Steps 1.4
AUC99.92
6
Diffusion Model Memorization Detection and MitigationLAION 400M
AUC0.999
1
Showing 10 of 10 rows

Other info

Follow for update