Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Coarse-to-Fine Amodal Segmentation with Shape Prior

About

Amodal object segmentation is a challenging task that involves segmenting both visible and occluded parts of an object. In this paper, we propose a novel approach, called Coarse-to-Fine Segmentation (C2F-Seg), that addresses this problem by progressively modeling the amodal segmentation. C2F-Seg initially reduces the learning space from the pixel-level image space to the vector-quantized latent space. This enables us to better handle long-range dependencies and learn a coarse-grained amodal segment from visual features and visible segments. However, this latent space lacks detailed information about the object, which makes it difficult to provide a precise segmentation directly. To address this issue, we propose a convolution refine module to inject fine-grained information and provide a more precise amodal object segmentation based on visual features and coarse-predicted segmentation. To help the studies of amodal object segmentation, we create a synthetic amodal dataset, named as MOViD-Amodal (MOViD-A), which can be used for both image and video amodal object segmentation. We extensively evaluate our model on two benchmark datasets: KINS and COCO-A. Our empirical results demonstrate the superiority of C2F-Seg. Moreover, we exhibit the potential of our approach for video amodal object segmentation tasks on FISHBOWL and our proposed MOViD-A. Project page at: http://jianxgao.github.io/C2F-Seg.

Jianxiong Gao, Xuelin Qian, Yikai Wang, Tianjun Xiao, Tong He, Zheng Zhang, Yanwei Fu• 2023

Related benchmarks

TaskDatasetResultRank
Amodal Instance SegmentationKINS (test)
Amodal AP82.22
21
Amodal SegmentationCOCOA (test)
mIoU (Full)80.28
13
Video Amodal SegmentationMOVi-D
mIoU71.67
12
Amodal SegmentationKINS
mIoU (Full)87.89
10
Amodal Instance SegmentationCOCOA cls
mIoU87.1
7
Amodal SegmentationCOCOA
mIoU (Full)80.28
6
Showing 6 of 6 rows

Other info

Follow for update