Learning to See the Invisible: End-to-End Trainable Amodal Instance Segmentation
About
Semantic amodal segmentation is a recently proposed extension to instance-aware segmentation that includes the prediction of the invisible region of each object instance. We present the first all-in-one end-to-end trainable model for semantic amodal segmentation that predicts the amodal instance masks as well as their visible and invisible part in a single forward pass. In a detailed analysis, we provide experiments to show which architecture choices are beneficial for an all-in-one amodal segmentation model. On the COCO amodal dataset, our model outperforms the current baseline for amodal segmentation by a large margin. To further evaluate our model, we provide two new datasets with ground truth for semantic amodal segmentation, D2S amodal and COCOA cls. For both datasets, our model provides a strong baseline performance. Using special data augmentation techniques, we show that amodal segmentation on D2S amodal is possible with reasonable performance, even without providing amodal training data.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Instance Segmentation | COCO 2017 (test-dev) | AP (Overall)37 | 253 | |
| Amodal Instance Segmentation | KINS (test) | Amodal AP29 | 16 | |
| Amodal Instance Segmentation | KINS | AP (Detection)26.97 | 10 | |
| Object Detection | KINS (test) | APdet30.9 | 9 | |
| Amodal Panoptic Segmentation | KITTI-360 APS (val) | APQ41.1 | 7 | |
| Amodal Panoptic Segmentation | BDD100K-APS (val) | APQ44.9 | 7 | |
| Instance Segmentation | COCO-OCC | AP29.67 | 5 | |
| Amodal Instance Segmentation | COCOA (test) | AP (All)21.51 | 4 | |
| Amodal Instance Segmentation | COCOA | APall21.51 | 4 |