Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mitigating Diffusion Model Hallucinations with Dynamic Guidance

About

Diffusion models, despite their impressive demos, often produce hallucinatory samples with structural inconsistencies that lie outside of the support of the true data distribution. Such hallucinations can be attributed to excessive smoothing between modes of the data distribution. However, semantic interpolations are often desirable and can lead to generation diversity, thus we believe a more nuanced solution is required. In this work, we introduce Dynamic Guidance, which tackles this issue. Dynamic Guidance mitigates hallucinations by selectively sharpening the score function only along the pre-determined directions known to cause artifacts, while preserving valid semantic variations. To our knowledge, this is the first approach that addresses hallucinations at generation time rather than through post-hoc filtering. Dynamic Guidance substantially reduces hallucinations on both controlled and natural image datasets, significantly outperforming baselines.

Kostas Triaridis, Alexandros Graikos, Aggelina Chatziagapi, Grigorios G. Chrysos, Dimitris Samaras• 2025

Related benchmarks

TaskDatasetResultRank
Image Generation11kHands
FID16.2
6
Image GenerationMNIST
FID32.1
6
Image GenerationSimpleShapes
FID27.8
6
Image GenerationFFHQ
FID13.8
6
Low-Dose CT ReconstructionRSNA 512-slice (val)
FID35.6
6
Synthetic Data GenerationGaussianGrid
MMD0.08
5
Showing 6 of 6 rows

Other info

Follow for update