Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Saliency Guided Self-attention Network for Weakly and Semi-supervised Semantic Segmentation

About

Weakly supervised semantic segmentation (WSSS) using only image-level labels can greatly reduce the annotation cost and therefore has attracted considerable research interest. However, its performance is still inferior to the fully supervised counterparts. To mitigate the performance gap, we propose a saliency guided self-attention network (SGAN) to address the WSSS problem. The introduced self-attention mechanism is able to capture rich and extensive contextual information but may mis-spread attentions to unexpected regions. In order to enable this mechanism to work effectively under weak supervision, we integrate class-agnostic saliency priors into the self-attention mechanism and utilize class-specific attention cues as an additional supervision for SGAN. Our SGAN is able to produce dense and accurate localization cues so that the segmentation performance is boosted. Moreover, by simply replacing the additional supervisions with partially labeled ground-truth, SGAN works effectively for semi-supervised semantic segmentation as well. Experiments on the PASCAL VOC 2012 and COCO datasets show that our approach outperforms all other state-of-the-art methods in both weakly and semi-supervised settings.

Qi Yao, Xiaojin Gong• 2019

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (val)
Mean IoU68.3
2040
Semantic segmentationPASCAL VOC 2012 (test)
mIoU68.5
1342
Semantic segmentationCOCO 2014 (val)
mIoU33.6
251
Weakly supervised semantic segmentationPASCAL VOC 2012 (test)
mIoU67.2
158
Weakly supervised semantic segmentationPASCAL VOC 2012 (val)
mIoU67.1
154
Semantic segmentationPASCAL VOC 2012 (train)
mIoU62.8
73
Weakly supervised semantic segmentationMS-COCO 2014 (val)
mIoU33.6
27
Weakly supervised semantic segmentationVOC 2012 (val)
mIoU67.1
19
Semantic segmentationPascal VOC Image Tags 2012 (test)
mIoU67.2
6
Semantic segmentationPASCAL-CONTEXT (trainval)
Confusion Ratio (boat vs water)0.1
5
Showing 10 of 10 rows

Other info

Code

Follow for update