Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

DuPL: Dual Student with Trustworthy Progressive Learning for Robust Weakly Supervised Semantic Segmentation

About

Recently, One-stage Weakly Supervised Semantic Segmentation (WSSS) with image-level labels has gained increasing interest due to simplification over its cumbersome multi-stage counterpart. Limited by the inherent ambiguity of Class Activation Map (CAM), we observe that one-stage pipelines often encounter confirmation bias caused by incorrect CAM pseudo-labels, impairing their final segmentation performance. Although recent works discard many unreliable pseudo-labels to implicitly alleviate this issue, they fail to exploit sufficient supervision for their models. To this end, we propose a dual student framework with trustworthy progressive learning (DuPL). Specifically, we propose a dual student network with a discrepancy loss to yield diverse CAMs for each sub-net. The two sub-nets generate supervision for each other, mitigating the confirmation bias caused by learning their own incorrect pseudo-labels. In this process, we progressively introduce more trustworthy pseudo-labels to be involved in the supervision through dynamic threshold adjustment with an adaptive noise filtering strategy. Moreover, we believe that every pixel, even discarded from supervision due to its unreliability, is important for WSSS. Thus, we develop consistency regularization on these discarded regions, providing supervision of every pixel. Experiment results demonstrate the superiority of the proposed DuPL over the recent state-of-the-art alternatives on PASCAL VOC 2012 and MS COCO datasets. Code is available at https://github.com/Wu0409/DuPL.

Yuanchen Wu, Xichen Ye, Kequan Yang, Jide Li, Xiaoqiang Li• 2024

Related benchmarks

TaskDatasetResultRank
Semantic segmentationPASCAL VOC 2012 (val)
Mean IoU73.3
2040
Semantic segmentationPASCAL VOC 2012 (test)
mIoU72.8
1342
Semantic segmentationCOCO 2014 (val)
mIoU44.6
251
Weakly supervised semantic segmentationPASCAL VOC 2012 (val)
mIoU74.1
154
Semantic segmentationCOCO (val)
mIoU44.6
135
Weakly supervised semantic segmentationPASCAL VOC 2012 (train)--
53
CAM seed generationPascal VOC (train)
mIoU75
19
Showing 7 of 7 rows

Other info

Code

Follow for update