Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Inst-Inpaint: Instructing to Remove Objects with Diffusion Models

About

Image inpainting task refers to erasing unwanted pixels from images and filling them in a semantically consistent and realistic way. Traditionally, the pixels that are wished to be erased are defined with binary masks. From the application point of view, a user needs to generate the masks for the objects they would like to remove which can be time-consuming and prone to errors. In this work, we are interested in an image inpainting algorithm that estimates which object to be removed based on natural language input and removes it, simultaneously. For this purpose, first, we construct a dataset named GQA-Inpaint for this task. Second, we present a novel inpainting framework, Inst-Inpaint, that can remove objects from images based on the instructions given as text prompts. We set various GAN and diffusion-based baselines and run experiments on synthetic and real image datasets. We compare methods with different evaluation metrics that measure the quality and accuracy of the models and show significant quantitative and qualitative improvements.

Ahmet Burak Yildirim, Vedat Baday, Erkut Erdem, Aykut Erdem, Aysegul Dundar• 2023

Related benchmarks

TaskDatasetResultRank
Instruction-guided image editingMagicBrush single-turn (test)
CLIP Similarity (Image)0.887
13
Interactive video inpaintingROVI (test)
PSNR18.96
9
Instruction-guided image editingMagicBrush multi-turn (test)
CLIP-T0.277
7
Instruction-guided image editingZONE (test)
CLIP-T0.267
7
Image InpaintingFSS-1000
FID7.32
6
Erase InpaintingOpenImages v5 (test)
FID11.423
6
Object EliminationOpenImages v5 (test)
PIDS (%)0.00e+0
6
Object RemovalGQA-Inpaint
PSNR22.533
5
referring video inpaintingROVI (test)
PSNR19
5
Object RemovalObject Removal
LPIPS0.227
4
Showing 10 of 10 rows

Other info

Follow for update