Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation

About

The deficiency of segmentation labels is one of the main obstacles to semantic segmentation in the wild. To alleviate this issue, we present a novel framework that generates segmentation labels of images given their image-level class labels. In this weakly supervised setting, trained models have been known to segment local discriminative parts rather than the entire object area. Our solution is to propagate such local responses to nearby areas which belong to the same semantic entity. To this end, we propose a Deep Neural Network (DNN) called AffinityNet that predicts semantic affinity between a pair of adjacent image coordinates. The semantic propagation is then realized by random walk with the affinities predicted by AffinityNet. More importantly, the supervision employed to train AffinityNet is given by the initial discriminative part segmentation, which is incomplete as a segmentation annotation but sufficient for learning semantic affinities within small image areas. Thus the entire framework relies only on image-level class labels and does not require any extra data or annotations. On the PASCAL VOC 2012 dataset, a DNN learned with segmentation labels generated by our method outperforms previous models trained with the same level of supervision, and is even as competitive as those relying on stronger supervision.

Jiwoon Ahn, Suha Kwak• 2018

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (val)
Mean IoU64.8
2040
Semantic segmentationPASCAL VOC 2012 (test)
mIoU65.5
1342
Semantic segmentationCamVid (test)
mIoU15.5
411
Semantic segmentationPASCAL VOC (val)
mIoU61.7
338
Semantic segmentationCityscapes (val)
mIoU8.2
287
Semantic segmentationPascal VOC (test)
mIoU63.7
236
Weakly supervised semantic segmentationPASCAL VOC 2012 (test)
mIoU63.7
158
Weakly supervised semantic segmentationPASCAL VOC 2012 (val)
mIoU61.7
154
Semantic segmentationCOCO (val)
mIoU29.5
135
Semantic segmentationPASCAL VOC 2012 (train)
mIoU59.7
73
Showing 10 of 14 rows

Other info

Follow for update