Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

K-Net: Towards Unified Image Segmentation

About

Semantic, instance, and panoptic segmentations have been addressed using different and specialized frameworks despite their underlying connections. This paper presents a unified, simple, and effective framework for these essentially similar tasks. The framework, named K-Net, segments both instances and semantic categories consistently by a group of learnable kernels, where each kernel is responsible for generating a mask for either a potential instance or a stuff class. To remedy the difficulties of distinguishing various instances, we propose a kernel update strategy that enables each kernel dynamic and conditional on its meaningful group in the input image. K-Net can be trained in an end-to-end manner with bipartite matching, and its training and inference are naturally NMS-free and box-free. Without bells and whistles, K-Net surpasses all previous published state-of-the-art single-model results of panoptic segmentation on MS COCO test-dev split and semantic segmentation on ADE20K val split with 55.2% PQ and 54.3% mIoU, respectively. Its instance segmentation performance is also on par with Cascade Mask R-CNN on MS COCO with 60%-90% faster inference speeds. Code and models will be released at https://github.com/ZwwWayne/K-Net/.

Wenwei Zhang, Jiangmiao Pang, Kai Chen, Chen Change Loy• 2021

Related benchmarks

TaskDatasetResultRank
Semantic segmentationADE20K (val)
mIoU54.3
2731
Instance SegmentationCOCO 2017 (val)
APm0.42
1144
Instance SegmentationCOCO (test-dev)
APM43.3
380
Panoptic SegmentationCityscapes (val)
PQ61.2
276
Instance SegmentationCOCO 2017 (test-dev)
AP (Overall)40.1
253
Panoptic SegmentationCOCO (val)
PQ54.6
219
Panoptic SegmentationCOCO 2017 (val)
PQ54.6
172
Panoptic SegmentationCOCO (test-dev)
PQ55.2
162
Semantic segmentationBDD100K (val)
mIoU67.59
72
Panoptic SegmentationCOCO 2017 (test-dev)
PQ55.2
41
Showing 10 of 17 rows

Other info

Code

Follow for update