Out-of-Candidate Rectification for Weakly Supervised Semantic Segmentation
About
Weakly supervised semantic segmentation is typically inspired by class activation maps, which serve as pseudo masks with class-discriminative regions highlighted. Although tremendous efforts have been made to recall precise and complete locations for each class, existing methods still commonly suffer from the unsolicited Out-of-Candidate (OC) error predictions that not belongs to the label candidates, which could be avoidable since the contradiction with image-level class tags is easy to be detected. In this paper, we develop a group ranking-based Out-of-Candidate Rectification (OCR) mechanism in a plug-and-play fashion. Firstly, we adaptively split the semantic categories into In-Candidate (IC) and OC groups for each OC pixel according to their prior annotation correlation and posterior prediction correlation. Then, we derive a differentiable rectification loss to force OC pixels to shift to the IC group. Incorporating our OCR with seminal baselines (e.g., AffinityNet, SEAM, MCTformer), we can achieve remarkable performance gains on both Pascal VOC (+3.2%, +3.3%, +0.8% mIoU) and MS COCO (+1.0%, +1.3%, +0.5% mIoU) datasets with negligible extra training overhead, which justifies the effectiveness and generality of our OCR.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | PASCAL VOC 2012 (val) | Mean IoU72.7 | 2040 | |
| Semantic segmentation | PASCAL VOC 2012 (test) | mIoU72 | 1342 | |
| Semantic segmentation | PASCAL VOC (val) | mIoU72.7 | 338 | |
| Semantic segmentation | COCO 2014 (val) | mIoU42.5 | 251 | |
| Semantic segmentation | Pascal VOC (test) | mIoU72 | 236 | |
| Weakly supervised semantic segmentation | PASCAL VOC 2012 (test) | mIoU72 | 158 | |
| Weakly supervised semantic segmentation | PASCAL VOC 2012 (val) | mIoU72.7 | 154 | |
| Semantic segmentation | COCO (val) | mIoU42.5 | 135 | |
| Weakly supervised semantic segmentation | PASCAL VOC 2012 (train) | mIoU (Mask)69.1 | 53 |