Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Refining Visual Artifacts in Diffusion Models via Explainable AI-based Flaw Activation Maps

About

Diffusion models have achieved remarkable success in image synthesis. However, addressing artifacts and unrealistic regions remains a critical challenge. We propose self-refining diffusion, a novel framework that enhances image generation quality by detecting these flaws. The framework employs an explainable artificial intelligence (XAI)-based flaw highlighter to produce flaw activation maps (FAMs) that identify artifacts and unrealistic regions. These FAMs improve reconstruction quality by amplifying noise in flawed regions during the forward process and by focusing on these regions during the reverse process. The proposed approach achieves up to a 27.3% improvement in Fr\'echet inception distance across various diffusion-based models, demonstrating consistently strong performance on diverse datasets. It also shows robust effectiveness across different tasks, including image generation, text-to-image generation, and inpainting. These results demonstrate that explainable AI techniques can extend beyond interpretability to actively contribute to image refinement. The proposed framework offers a versatile and effective approach applicable to various diffusion models and tasks, significantly advancing the field of image synthesis.

Seoyeon Lee, Gwangyeol Yu, Chaewon Kim, Jonghyuk Park• 2025

Related benchmarks

TaskDatasetResultRank
Text-to-Image GenerationMS-COCO
FID24.633
75
Image GenerationCelebA-HQ--
23
Image GenerationOxford Flower 102 128x128 (test)
FID19.669
2
Image GenerationLSUN Church 64x64 (test)
FID9.093
2
Image InpaintingCelebA-HQ Wide masks (test)
FID10.3564
2
Image InpaintingCelebA-HQ Narrow masks (test)
FID5.5396
2
Image InpaintingCelebA-HQ Alternating Lines masks (test)
FID1.2211
2
Showing 7 of 7 rows

Other info

Follow for update