Category Anchor-Guided Unsupervised Domain Adaptation for Semantic Segmentation
About
Unsupervised domain adaptation (UDA) aims to enhance the generalization capability of a certain model from a source domain to a target domain. UDA is of particular significance since no extra effort is devoted to annotating target domain samples. However, the different data distributions in the two domains, or \emph{domain shift/discrepancy}, inevitably compromise the UDA performance. Although there has been a progress in matching the marginal distributions between two domains, the classifier favors the source domain features and makes incorrect predictions on the target domain due to category-agnostic feature alignment. In this paper, we propose a novel category anchor-guided (CAG) UDA model for semantic segmentation, which explicitly enforces category-aware feature alignment to learn shared discriminative features and classifiers simultaneously. First, the category-wise centroids of the source domain features are used as guided anchors to identify the active features in the target domain and also assign them pseudo-labels. Then, we leverage an anchor-based pixel-level distance loss and a discriminative loss to drive the intra-category features closer and the inter-category features further apart, respectively. Finally, we devise a stagewise training mechanism to reduce the error accumulation and adapt the proposed model progressively. Experiments on both the GTA5$\rightarrow $Cityscapes and SYNTHIA$\rightarrow $Cityscapes scenarios demonstrate the superiority of our CAG-UDA model over the state-of-the-art methods. The code is available at \url{https://github.com/RogerZhangzz/CAG_UDA}.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | GTA5 → Cityscapes (val) | mIoU50.2 | 533 | |
| Semantic segmentation | SYNTHIA to Cityscapes (val) | Rider IoU81.8 | 435 | |
| Semantic segmentation | Cityscapes GTA5 to Cityscapes adaptation (val) | mIoU (Overall)50.2 | 352 | |
| Semantic segmentation | GTA5 to Cityscapes (test) | mIoU51.7 | 151 | |
| Semantic segmentation | SYNTHIA to Cityscapes | Road IoU84.8 | 150 | |
| Semantic segmentation | Synthia to Cityscapes (test) | Road IoU84.7 | 138 | |
| Semantic segmentation | Cityscapes (val) | mIoU50.2 | 133 | |
| Semantic segmentation | GTA5 to Cityscapes 1.0 (val) | Road IoU90.4 | 98 | |
| Semantic segmentation | GTA to Cityscapes | Road IoU90.4 | 72 | |
| Semantic segmentation | Cityscapes GTA5 source 1.0 (val) | mIoU50.2 | 49 |