Coarse-to-Fine Amodal Segmentation with Shape Prior

About

Amodal object segmentation is a challenging task that involves segmenting both visible and occluded parts of an object. In this paper, we propose a novel approach, called Coarse-to-Fine Segmentation (C2F-Seg), that addresses this problem by progressively modeling the amodal segmentation. C2F-Seg initially reduces the learning space from the pixel-level image space to the vector-quantized latent space. This enables us to better handle long-range dependencies and learn a coarse-grained amodal segment from visual features and visible segments. However, this latent space lacks detailed information about the object, which makes it difficult to provide a precise segmentation directly. To address this issue, we propose a convolution refine module to inject fine-grained information and provide a more precise amodal object segmentation based on visual features and coarse-predicted segmentation. To help the studies of amodal object segmentation, we create a synthetic amodal dataset, named as MOViD-Amodal (MOViD-A), which can be used for both image and video amodal object segmentation. We extensively evaluate our model on two benchmark datasets: KINS and COCO-A. Our empirical results demonstrate the superiority of C2F-Seg. Moreover, we exhibit the potential of our approach for video amodal object segmentation tasks on FISHBOWL and our proposed MOViD-A. Project page at: http://jianxgao.github.io/C2F-Seg.

Jianxiong Gao, Xuelin Qian, Yikai Wang, Tianjun Xiao, Tong He, Zheng Zhang, Yanwei Fu• 2023

Related benchmarks

Task	Dataset	Result
Amodal Instance Segmentation	KINS (test)	Amodal AP82.22	21
Amodal Segmentation	COCOA (test)	mIoU (Full)80.28	13
Video Amodal Segmentation	MOVi-D	mIoU71.67	12
Amodal Segmentation	KINS	mIoU (Full)87.89	10
Amodal Instance Segmentation	COCOA cls	mIoU87.1	7
Amodal Segmentation	COCOA	mIoU (Full)80.28	6

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord