Aligned Stable Inpainting: Mitigating Unwanted Object Insertion and Preserving Color Consistency

About

Generative image inpainting can produce realistic results even with large, irregular masks, but existing methods still suffer from two common problems: (1) Unwanted object insertion: hallucinate artifacts that do not match the surrounding context. (2) Color inconsistency: noticeable color shifts that lead to smeared textures. We analyze the causes of these issues and propose Aligned Stable inpainting with UnKnown Areas prior (ASUKA), a post-hoc framework for pre-trained inpainting models. To reduce unwanted object insertion, we use reconstruction-based priors to guide the generative model, suppressing hallucinated objects while preserving generative flexibility. To address color inconsistency, we design a specialized VAE decoder that formulates latent-to-image decoding as a local harmonization task. We implement ASUKA on both U-Net-based and DiT-based inpainting models with lightweight modifications. Experiments on Places2 and MISATO, our proposed benchmark, show that ASUKA effectively suppresses object hallucination and improves color consistency, outperforming existing diffusion- and rectified flow-based inpainting methods. The dataset, models, and code will be released on GitHub.

Yikai Wang, Junqiu Yu, Chenjie Cao, Xiangyang Xue, Yanwei Fu• 2026

Related benchmarks

Task	Dataset	Result
Image Inpainting	FFHQ (test)	LPIPS0.128	97
Image Inpainting	CelebA-HQ (test)	LPIPS0.126	18
Image Inpainting	MISATO @512 (test)	LPIPS0.139	17
Inpainting	Places 2 (val)	LPIPS0.174	15
Image Inpainting	User Study 40 random images (test)	UOM32.88	12
Image Inpainting	MISATO User Study 1.0 (test)	UOM39.43	9
Object Hallucination Evaluation	MISATO 512 resolution	VLM Judgment136	5
Image Inpainting	MISATO@1K (test)	LPIPS0.156	4

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord