Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Self-supervised Equivariant Attention Mechanism for Weakly Supervised Semantic Segmentation

About

Image-level weakly supervised semantic segmentation is a challenging problem that has been deeply studied in recent years. Most of advanced solutions exploit class activation map (CAM). However, CAMs can hardly serve as the object mask due to the gap between full and weak supervisions. In this paper, we propose a self-supervised equivariant attention mechanism (SEAM) to discover additional supervision and narrow the gap. Our method is based on the observation that equivariance is an implicit constraint in fully supervised semantic segmentation, whose pixel-level labels take the same spatial transformation as the input images during data augmentation. However, this constraint is lost on the CAMs trained by image-level supervision. Therefore, we propose consistency regularization on predicted CAMs from various transformed images to provide self-supervision for network learning. Moreover, we propose a pixel correlation module (PCM), which exploits context appearance information and refines the prediction of current pixel by its similar neighbors, leading to further improvement on CAMs consistency. Extensive experiments on PASCAL VOC 2012 dataset demonstrate our method outperforms state-of-the-art methods using the same level of supervision. The code is released online.

Yude Wang, Jie Zhang, Meina Kan, Shiguang Shan, Xilin Chen• 2020

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (val)
Mean IoU64.5
2040
Semantic segmentationPASCAL VOC 2012 (test)
mIoU65.7
1342
Change DetectionLEVIR-CD (test)
F1 Score41.38
357
Semantic segmentationPASCAL VOC (val)
mIoU64.5
338
Semantic segmentationCityscapes (val)
mIoU17.3
287
Change DetectionWHU-CD (test)
IoU33.42
286
Semantic segmentationCOCO 2014 (val)
mIoU32.8
251
Semantic segmentationPascal VOC (test)
mIoU65.7
236
Weakly supervised semantic segmentationPASCAL VOC 2012 (test)
mIoU65.7
158
Weakly supervised semantic segmentationPASCAL VOC 2012 (val)
mIoU64.5
154
Showing 10 of 41 rows

Other info

Code

Follow for update