Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Seamless Scene Segmentation

About

In this work we introduce a novel, CNN-based architecture that can be trained end-to-end to deliver seamless scene segmentation results. Our goal is to predict consistent semantic segmentation and detection results by means of a panoptic output format, going beyond the simple combination of independently trained segmentation and detection models. The proposed architecture takes advantage of a novel segmentation head that seamlessly integrates multi-scale features generated by a Feature Pyramid Network with contextual information conveyed by a light-weight DeepLab-like module. As additional contribution we review the panoptic metric and propose an alternative that overcomes its limitations when evaluating non-instance categories. Our proposed network architecture yields state-of-the-art results on three challenging street-level datasets, i.e. Cityscapes, Indian Driving Dataset and Mapillary Vistas.

Lorenzo Porzi, Samuel Rota Bul\`o, Aleksander Colovic, Peter Kontschieder• 2019

Related benchmarks

TaskDatasetResultRank
Semantic segmentationCityscapes (val)
mIoU80.7
572
Panoptic SegmentationCityscapes (val)
PQ65
276
Instance SegmentationCityscapes (val)
AP33.6
239
Panoptic SegmentationMapillary Vistas (val)
PQ37.7
82
Semantic segmentationMapillary Vistas (val)
mIoU50.4
72
Panoptic SegmentationCityscapes (test)
PQ62.6
51
Semantic segmentationWildDash bench (test)
mIoU Meta Avg (cla)37.9
19
Instance SegmentationMapillary Vistas Dataset (val)
AP16.4
19
Semantic segmentationWildDash 2 (val)
mIoU37.1
9
Hierarchical Semantic SegmentationMapillary Vistas 2.0 (val)
mIoU (Level 1)38.17
9
Showing 10 of 12 rows

Other info

Follow for update