Causal Intervention for Weakly-Supervised Semantic Segmentation
About
We present a causal inference framework to improve Weakly-Supervised Semantic Segmentation (WSSS). Specifically, we aim to generate better pixel-level pseudo-masks by using only image-level labels -- the most crucial step in WSSS. We attribute the cause of the ambiguous boundaries of pseudo-masks to the confounding context, e.g., the correct image-level classification of "horse" and "person" may be not only due to the recognition of each instance, but also their co-occurrence context, making the model inspection (e.g., CAM) hard to distinguish between the boundaries. Inspired by this, we propose a structural causal model to analyze the causalities among images, contexts, and class labels. Based on it, we develop a new method: Context Adjustment (CONTA), to remove the confounding bias in image-level classification and thus provide better pseudo-masks as ground-truth for the subsequent segmentation model. On PASCAL VOC 2012 and MS-COCO, we show that CONTA boosts various popular WSSS methods to new state-of-the-arts.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | PASCAL VOC 2012 (val) | Mean IoU66.1 | 2040 | |
| Semantic segmentation | PASCAL VOC 2012 (test) | mIoU66.7 | 1342 | |
| Semantic segmentation | PASCAL VOC (val) | mIoU66.1 | 338 | |
| Semantic segmentation | COCO 2014 (val) | mIoU33.4 | 251 | |
| Semantic segmentation | Pascal VOC (test) | mIoU66.7 | 236 | |
| Weakly supervised semantic segmentation | PASCAL VOC 2012 (test) | mIoU66.7 | 158 | |
| Weakly supervised semantic segmentation | PASCAL VOC 2012 (val) | mIoU66.1 | 154 | |
| Semantic segmentation | COCO (val) | mIoU33.4 | 135 | |
| Semantic segmentation | COCO Object (val) | mIoU0.334 | 77 | |
| Semantic segmentation | VOC 2012 (val) | mIoU66.1 | 67 |