One-Shot Instance Segmentation
About
We tackle the problem of one-shot instance segmentation: Given an example image of a novel, previously unknown object category, find and segment all objects of this category within a complex scene. To address this challenging new task, we propose Siamese Mask R-CNN. It extends Mask R-CNN by a Siamese backbone encoding both reference image and scene, allowing it to target detection and segmentation towards the reference category. We demonstrate empirical results on MS Coco highlighting challenges of the one-shot setting: while transferring knowledge about instance segmentation to novel object categories works very well, targeting the detection network towards the reference category appears to be more difficult. Our work provides a first strong baseline for one-shot instance segmentation and will hopefully inspire further research into more powerful and flexible scene analysis algorithms. Code is available at: https://github.com/bethgelab/siamese-mask-rcnn
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 1-way segmentation | COCO-20i | mIoU (Fold 0)17.5 | 18 | |
| Object Detection | COCO-20i (test) | AP (COCO-20^0)20 | 8 | |
| Object Counting | COCO 2017 (val Fold 2) | MAE2.64 | 6 | |
| Object Counting | COCO Fold 0 2017 (val) | MAE2.91 | 6 | |
| Object Counting | COCO 2017 (val Fold 1) | MAE2.47 | 6 | |
| Object Counting | COCO 2017 (val Fold 3) | MAE2.82 | 6 | |
| Object Counting | COCO Average 2017 (val) | MAE2.71 | 6 | |
| Object Detection | COCO 2017 (Split-1) | Base AP5038.9 | 6 | |
| Object Detection | COCO 2017 (Split-2) | bAP5037.1 | 6 | |
| Object Detection | COCO 2017 (Split-3) | Base AP5037.8 | 6 |