Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting

About

Advancing image inpainting is challenging as it requires filling user-specified regions for various intents, such as background filling and object synthesis. Existing approaches focus on either context-aware filling or object synthesis using text descriptions. However, achieving both tasks simultaneously is challenging due to differing training strategies. To overcome this challenge, we introduce PowerPaint, the first high-quality and versatile inpainting model that excels in multiple inpainting tasks. First, we introduce learnable task prompts along with tailored fine-tuning strategies to guide the model's focus on different inpainting targets explicitly. This enables PowerPaint to accomplish various inpainting tasks by utilizing different task prompts, resulting in state-of-the-art performance. Second, we demonstrate the versatility of the task prompt in PowerPaint by showcasing its effectiveness as a negative prompt for object removal. Moreover, we leverage prompt interpolation techniques to enable controllable shape-guided object inpainting, enhancing the model's applicability in shape-guided applications. Finally, we conduct extensive experiments and applications to verify the effectiveness of PowerPaint. We release our codes and models on our project page: https://powerpaint.github.io/.

Junhao Zhuang, Yanhong Zeng, Wenran Liu, Chun Yuan, Kai Chen• 2023

Related benchmarks

TaskDatasetResultRank
Object RemovalOBER (test)
PSNR26.2
20
Object RemovalOBER-Wild
ReMOVE† Score80.44
20
Object RemovalRORD (val)
PSNR21.46
20
Object RemovalRemovalBench
Latency (s)2
15
Text-guided image inpaintingMSCOCO with layout masks (test)
ImageReward0.2593
15
Image InpaintingEditBench free-form masks (val)
ImageReward0.0842
15
Object RemovalRemovalBench paired
SSIM0.751
11
Object RemovalOpenImages V7 2020 (test)
BG Similarity66.9
11
Object RemovalRORD 2022 (test)
BG Similarity72.9
11
Object RemovalMULAN
PSNR21.18
11
Showing 10 of 26 rows

Other info

Follow for update