Pixel Contrastive-Consistent Semi-Supervised Semantic Segmentation
About
We present a novel semi-supervised semantic segmentation method which jointly achieves two desiderata of segmentation model regularities: the label-space consistency property between image augmentations and the feature-space contrastive property among different pixels. We leverage the pixel-level L2 loss and the pixel contrastive loss for the two purposes respectively. To address the computational efficiency issue and the false negative noise issue involved in the pixel contrastive loss, we further introduce and investigate several negative sampling techniques. Extensive experiments demonstrate the state-of-the-art performance of our method (PC2Seg) with the DeepLab-v3+ architecture, in several challenging semi-supervised settings derived from the VOC, Cityscapes, and COCO datasets.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | PASCAL VOC 2012 (val) | Mean IoU74.2 | 2040 | |
| Semantic segmentation | COCO 2014 (val) | mIoU46.1 | 251 | |
| Semantic segmentation | PASCAL VOC classic 2012 (val) | mIoU74.2 | 143 | |
| Semantic segmentation | COCO (val) | mIoU46.1 | 135 | |
| Semantic segmentation | PASCAL VOC 2012 (val) | -- | 126 | |
| Semantic segmentation | COCO | mIoU46.1 | 96 | |
| Semantic segmentation | Pascal VOC Original protocol 92 labeled images | mIoU56.9 | 48 | |
| Semantic segmentation | PASCAL VOC 2012 original (val) | mIoU74.15 | 46 | |
| Semantic segmentation | Cityscapes fine (val) | mIoU75.99 | 42 | |
| Semantic segmentation | Pascal VOC Original protocol 1464 labeled images | mIoU72.3 | 36 |