Causal Unsupervised Semantic Segmentation
About
Unsupervised semantic segmentation aims to achieve high-quality semantic grouping without human-labeled annotations. With the advent of self-supervised pre-training, various frameworks utilize the pre-trained features to train prediction heads for unsupervised dense prediction. However, a significant challenge in this unsupervised setup is determining the appropriate level of clustering required for segmenting concepts. To address it, we propose a novel framework, CAusal Unsupervised Semantic sEgmentation (CAUSE), which leverages insights from causal inference. Specifically, we bridge intervention-oriented approach (i.e., frontdoor adjustment) to define suitable two-step tasks for unsupervised prediction. The first step involves constructing a concept clusterbook as a mediator, which represents possible concept prototypes at different levels of granularity in a discretized form. Then, the mediator establishes an explicit link to the subsequent concept-wise self-supervised learning for pixel-level grouping. Through extensive experiments and analyses on various datasets, we corroborate the effectiveness of CAUSE and achieve state-of-the-art performance in unsupervised semantic segmentation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | PASCAL VOC 2012 (test) | mIoU53.3 | 1342 | |
| Semantic segmentation | Cityscapes-C (val) | mIoU28 | 56 | |
| 2D Ephemeral Object Segmentation | Mapverse-Ithaca365 1.0 (test) | mIoU20.88 | 20 | |
| Semantic segmentation | Cityscapes 27 Classes (test) | Accuracy94.5 | 12 | |
| Semantic segmentation | COCO-Stuff mid-level 27 categories (test) | mIoU52.3 | 8 | |
| Unsupervised Semantic Segmentation | COCO-S-171 | mIoU15.2 | 8 | |
| Unsupervised Semantic Segmentation | COCO-81 | mIoU21.2 | 4 |