Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Expansion and Shrinkage of Localization for Weakly-Supervised Semantic Segmentation

About

Generating precise class-aware pseudo ground-truths, a.k.a, class activation maps (CAMs), is essential for weakly-supervised semantic segmentation. The original CAM method usually produces incomplete and inaccurate localization maps. To tackle with this issue, this paper proposes an Expansion and Shrinkage scheme based on the offset learning in the deformable convolution, to sequentially improve the recall and precision of the located object in the two respective stages. In the Expansion stage, an offset learning branch in a deformable convolution layer, referred as "expansion sampler" seeks for sampling increasingly less discriminative object regions, driven by an inverse supervision signal that maximizes image-level classification loss. The located more complete object in the Expansion stage is then gradually narrowed down to the final object region during the Shrinkage stage. In the Shrinkage stage, the offset learning branch of another deformable convolution layer, referred as "shrinkage sampler", is introduced to exclude the false positive background regions attended in the Expansion stage to improve the precision of the localization maps. We conduct various experiments on PASCAL VOC 2012 and MS COCO 2014 to well demonstrate the superiority of our method over other state-of-the-art methods for weakly-supervised semantic segmentation. Code will be made publicly available here https://github.com/TyroneLi/ESOL_WSSS.

Jinlong Li, Zequn Jie, Xu Wang, Xiaolin Wei, Lin Ma• 2022

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (val)
Mean IoU71.1
2040
Semantic segmentationPASCAL VOC 2012 (test)
mIoU70.4
1342
Semantic segmentationCOCO 2014 (val)
mIoU42.6
251
Semantic segmentationCOCO (val)
mIoU42.6
135
Weakly supervised semantic segmentationPASCAL VOC 2012 (train)
mIoU (Mask)68.7
53
Pseudo Ground-Truth GenerationPASCAL VOC 2012 (train)--
19
Weakly supervised semantic segmentationMS COCO 2014 (train)--
4
Showing 7 of 7 rows

Other info

Code

Follow for update