Refining Visual Artifacts in Diffusion Models via Explainable AI-based Flaw Activation Maps

About

Diffusion models have achieved remarkable success in image synthesis. However, addressing artifacts and unrealistic regions remains a critical challenge. We propose self-refining diffusion, a novel framework that enhances image generation quality by detecting these flaws. The framework employs an explainable artificial intelligence (XAI)-based flaw highlighter to produce flaw activation maps (FAMs) that identify artifacts and unrealistic regions. These FAMs improve reconstruction quality by amplifying noise in flawed regions during the forward process and by focusing on these regions during the reverse process. The proposed approach achieves up to a 27.3% improvement in Fr\'echet inception distance across various diffusion-based models, demonstrating consistently strong performance on diverse datasets. It also shows robust effectiveness across different tasks, including image generation, text-to-image generation, and inpainting. These results demonstrate that explainable AI techniques can extend beyond interpretability to actively contribute to image refinement. The proposed framework offers a versatile and effective approach applicable to various diffusion models and tasks, significantly advancing the field of image synthesis.

Seoyeon Lee, Gwangyeol Yu, Chaewon Kim, Jonghyuk Park• 2025

Related benchmarks

Task	Dataset	Result
Text-to-Image Generation	MS-COCO	FID24.633	75
Image Generation	CelebA-HQ	--	23
Image Generation	Oxford Flower 102 128x128 (test)	FID19.669	2
Image Generation	LSUN Church 64x64 (test)	FID9.093	2
Image Inpainting	CelebA-HQ Wide masks (test)	FID10.3564	2
Image Inpainting	CelebA-HQ Narrow masks (test)	FID5.5396	2
Image Inpainting	CelebA-HQ Alternating Lines masks (test)	FID1.2211	2

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord