Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Self-supervised Image-specific Prototype Exploration for Weakly Supervised Semantic Segmentation

About

Weakly Supervised Semantic Segmentation (WSSS) based on image-level labels has attracted much attention due to low annotation costs. Existing methods often rely on Class Activation Mapping (CAM) that measures the correlation between image pixels and classifier weight. However, the classifier focuses only on the discriminative regions while ignoring other useful information in each image, resulting in incomplete localization maps. To address this issue, we propose a Self-supervised Image-specific Prototype Exploration (SIPE) that consists of an Image-specific Prototype Exploration (IPE) and a General-Specific Consistency (GSC) loss. Specifically, IPE tailors prototypes for every image to capture complete regions, formed our Image-Specific CAM (IS-CAM), which is realized by two sequential steps. In addition, GSC is proposed to construct the consistency of general CAM and our specific IS-CAM, which further optimizes the feature representation and empowers a self-correction ability of prototype exploration. Extensive experiments are conducted on PASCAL VOC 2012 and MS COCO 2014 segmentation benchmark and results show our SIPE achieves new state-of-the-art performance using only image-level labels. The code is available at https://github.com/chenqi1126/SIPE.

Qi Chen, Lingxiao Yang, Jianhuang Lai, Xiaohua Xie• 2022

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (val)
Mean IoU69.5
2142
Semantic segmentationPASCAL VOC 2012 (test)
mIoU69.7
1415
Semantic segmentationCOCO 2014 (val)
mIoU44.7
304
Weakly supervised semantic segmentationPASCAL VOC 2012 (val)
mIoU68.8
168
Weakly supervised semantic segmentationPASCAL VOC 2012 (test)
mIoU69.7
158
Semantic segmentationCOCO (val)
mIoU43.6
150
Semantic segmentationPASCAL VOC 2012 (val)
mIoU58.6
126
Semantic segmentationCOCO 2017 (val)
mIoU43.6
66
Pseudo Ground-Truth GenerationPASCAL VOC 2012 (train)
mIoU64.4
19
Weakly Supervised Class LocalizationPascal VOC (train)
mIoU58.6
15
Showing 10 of 11 rows

Other info

Code

Follow for update