Panoptic Segmentation with a Joint Semantic and Instance Segmentation Network
About
We present a single network method for panoptic segmentation. This method combines the predictions from a jointly trained semantic and instance segmentation network using heuristics. Joint training is the first step towards an end-to-end panoptic segmentation network and is faster and more memory efficient than training and predicting with two networks, as done in previous work. The architecture consists of a ResNet-50 feature extractor shared by the semantic segmentation and instance segmentation branch. For instance segmentation, a Mask R-CNN type of architecture is used, while the semantic segmentation branch is augmented with a Pyramid Pooling Module. Results for this method are submitted to the COCO and Mapillary Joint Recognition Challenge 2018. Our approach achieves a PQ score of 17.6 on the Mapillary Vistas validation set and 27.2 on the COCO test-dev set.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Panoptic Segmentation | COCO (val) | PQ26.9 | 219 | |
| Panoptic Segmentation | COCO 2017 (val) | PQ26.9 | 172 | |
| Panoptic Segmentation | COCO (test-dev) | PQ27.2 | 162 | |
| Panoptic Segmentation | Mapillary Vistas (val) | PQ17.6 | 82 | |
| Panoptic Segmentation | COCO 2017 (test-dev) | PQ27.2 | 41 | |
| Panoptic Segmentation | MS-COCO (val) | PQ26.9 | 24 | |
| Panoptic Segmentation | MS-COCO 2018 (test-dev) | PQ (Panoptic Quality)27.2 | 15 | |
| Panoptic Segmentation | COCO 2017 2018 (val) | PQ26.9 | 6 |