Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Amodal Completion via Progressive Mixed Context Diffusion

About

Our brain can effortlessly recognize objects even when partially hidden from view. Seeing the visible of the hidden is called amodal completion; however, this task remains a challenge for generative AI despite rapid progress. We propose to sidestep many of the difficulties of existing approaches, which typically involve a two-step process of predicting amodal masks and then generating pixels. Our method involves thinking outside the box, literally! We go outside the object bounding box to use its context to guide a pre-trained diffusion inpainting model, and then progressively grow the occluded object and trim the extra background. We overcome two technical challenges: 1) how to be free of unwanted co-occurrence bias, which tends to regenerate similar occluders, and 2) how to judge if an amodal completion has succeeded. Our amodal completion method exhibits improved photorealistic completion results compared to existing approaches in numerous successful completion cases. And the best part? It doesn't require any special training or fine-tuning of models.

Katherine Xu, Lingzhi Zhang, Jianbo Shi• 2023

Related benchmarks

TaskDatasetResultRank
Amodal CompletionHiFi-Amodal dataset
CLIP-I95.29
5
Occluded Object RecognitionOccluded COCO
Top-1 Accuracy44.74
5
Occluded Object RecognitionCOCO Separated
Top-1 Acc34.5
5
Amodal CompletionPix2Gestalt ground-truth benchmark (test)
GT-LPIPS0.225
5
Amodal CompletionFree Images
CLIP Score (Image)96.005
4
Amodal CompletionLAION
CLIP Image Score94.687
4
Amodal SegmentationCOCO-A (Easy)
mIoU86.9
4
Amodal SegmentationBSDS-A (Hard)
mIoU57.37
4
Amodal CompletionVG
Human Preference Rate0.1662
4
Amodal CompletionCOCO-A
Human Preference Rate15.38
4
Showing 10 of 25 rows

Other info

Follow for update