Coarse-to-Fine Domain Adaptive Semantic Segmentation with Photometric Alignment and Category-Center Regularization
About
Unsupervised domain adaptation (UDA) in semantic segmentation is a fundamental yet promising task relieving the need for laborious annotation works. However, the domain shifts/discrepancies problem in this task compromise the final segmentation performance. Based on our observation, the main causes of the domain shifts are differences in imaging conditions, called image-level domain shifts, and differences in object category configurations called category-level domain shifts. In this paper, we propose a novel UDA pipeline that unifies image-level alignment and category-level feature distribution regularization in a coarse-to-fine manner. Specifically, on the coarse side, we propose a photometric alignment module that aligns an image in the source domain with a reference image from the target domain using a set of image-level operators; on the fine side, we propose a category-oriented triplet loss that imposes a soft constraint to regularize category centers in the source domain and a self-supervised consistency regularization method in the target domain. Experimental results show that our proposed pipeline improves the generalization capability of the final segmentation model and significantly outperforms all previous state-of-the-arts.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | GTA5 → Cityscapes (val) | mIoU56.1 | 533 | |
| Semantic segmentation | SYNTHIA to Cityscapes (val) | Rider IoU33.2 | 435 | |
| Semantic segmentation | SYNTHIA to Cityscapes | Road IoU75.7 | 150 | |
| Semantic segmentation | Synthia to Cityscapes (test) | Road IoU75.7 | 138 | |
| Semantic segmentation | GTA5 to Cityscapes 1.0 (val) | Road IoU92.5 | 98 | |
| Semantic segmentation | GTA to Cityscapes | Road IoU92.5 | 72 | |
| Semantic segmentation | Cityscapes (val) | Road IoU75.7 | 29 |