Class-Balanced Pixel-Level Self-Labeling for Domain Adaptive Semantic Segmentation
About
Domain adaptive semantic segmentation aims to learn a model with the supervision of source domain data, and produce satisfactory dense predictions on unlabeled target domain. One popular solution to this challenging task is self-training, which selects high-scoring predictions on target samples as pseudo labels for training. However, the produced pseudo labels often contain much noise because the model is biased to source domain as well as majority categories. To address the above issues, we propose to directly explore the intrinsic pixel distributions of target domain data, instead of heavily relying on the source domain. Specifically, we simultaneously cluster pixels and rectify pseudo labels with the obtained cluster assignments. This process is done in an online fashion so that pseudo labels could co-evolve with the segmentation model without extra training rounds. To overcome the class imbalance problem on long-tailed categories, we employ a distribution alignment technique to enforce the marginal class distribution of cluster assignments to be close to that of pseudo labels. The proposed method, namely Class-balanced Pixel-level Self-Labeling (CPSL), improves the segmentation performance on target domain over state-of-the-arts by a large margin, especially on long-tailed categories.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | GTA5 → Cityscapes (val) | mIoU60.8 | 533 | |
| Semantic segmentation | SYNTHIA to Cityscapes (val) | Rider IoU32 | 435 | |
| Semantic segmentation | GTA5 to Cityscapes (test) | mIoU61.7 | 151 | |
| Semantic segmentation | SYNTHIA to Cityscapes | Road IoU87.2 | 150 | |
| Semantic segmentation | Synthia to Cityscapes (test) | -- | 138 | |
| Semantic segmentation | GTA5 to Cityscapes 1.0 (val) | Road IoU92.3 | 98 | |
| Semantic segmentation | SYNTHIA-to-Cityscapes 16 categories (val) | mIoU (Overall)54.4 | 74 | |
| Semantic segmentation | Cityscapes trained on SYNTHIA (val) | Road IoU87.2 | 60 | |
| Semantic segmentation | GTA5 to Cityscapes | mIoU60.8 | 58 | |
| Semantic segmentation | GTA to Cityscapes (val) | Road Accuracy92.3 | 44 |