Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance

About

Recently, diffusion models have emerged as promising newcomers in the field of generative models, shining brightly in image generation. However, when employed for object removal tasks, they still encounter issues such as generating random artifacts and the incapacity to repaint foreground object areas with appropriate content after removal. To tackle these problems, we propose Attentive Eraser, a tuning-free method to empower pre-trained diffusion models for stable and effective object removal. Firstly, in light of the observation that the self-attention maps influence the structure and shape details of the generated images, we propose Attention Activation and Suppression (ASS), which re-engineers the self-attention mechanism within the pre-trained diffusion models based on the given mask, thereby prioritizing the background over the foreground object during the reverse generation process. Moreover, we introduce Self-Attention Redirection Guidance (SARG), which utilizes the self-attention redirected by ASS to guide the generation process, effectively removing foreground objects within the mask while simultaneously generating content that is both plausible and coherent. Experiments demonstrate the stability and effectiveness of Attentive Eraser in object removal across a variety of pre-trained diffusion models, outperforming even training-based methods. Furthermore, Attentive Eraser can be implemented in various diffusion model architectures and checkpoints, enabling excellent scalability. Code is available at https://github.com/Anonym0u3/AttentiveEraser.

Wenhao Sun, Benlei Cui, Xue-Mei Dong, Jingqun Tang• 2024

Related benchmarks

TaskDatasetResultRank
Rain RemovalRain 0.5
PSNR (dB)36.0376
20
Lip-readingGRID (test)
WER2.9
18
Object RemovalRemovalBench
Latency (s)25
15
Object RemovalRemovalBench paired
SSIM0.78
11
Object RemovalOpenImages V7 2020 (test)
BG Similarity55.9
11
Object RemovalRORD 2022 (test)
BG Similarity54.1
11
Video Object RemovalDAVIS
TokSim30.82
10
Rain RemovalRain
PSNR (dB)31.4529
10
Snow RemovalCityScape + Snow100K
PSNR (dB)27.77
10
Video Object RemovalWIPER-Bench
TokSim25.28
9
Showing 10 of 12 rows

Other info

Follow for update