Coarse-to-Fine Domain Adaptive Semantic Segmentation with Photometric Alignment and Category-Center Regularization

About

Unsupervised domain adaptation (UDA) in semantic segmentation is a fundamental yet promising task relieving the need for laborious annotation works. However, the domain shifts/discrepancies problem in this task compromise the final segmentation performance. Based on our observation, the main causes of the domain shifts are differences in imaging conditions, called image-level domain shifts, and differences in object category configurations called category-level domain shifts. In this paper, we propose a novel UDA pipeline that unifies image-level alignment and category-level feature distribution regularization in a coarse-to-fine manner. Specifically, on the coarse side, we propose a photometric alignment module that aligns an image in the source domain with a reference image from the target domain using a set of image-level operators; on the fine side, we propose a category-oriented triplet loss that imposes a soft constraint to regularize category centers in the source domain and a self-supervised consistency regularization method in the target domain. Experimental results show that our proposed pipeline improves the generalization capability of the final segmentation model and significantly outperforms all previous state-of-the-arts.

Haoyu Ma, Xiangru Lin, Zifeng Wu, Yizhou Yu• 2021

Related benchmarks

Task	Dataset	Result
Semantic segmentation	GTA5 → Cityscapes (val)	mIoU56.1	586
Semantic segmentation	SYNTHIA to Cityscapes (val)	Rider IoU33.2	480
Semantic segmentation	SYNTHIA to Cityscapes	Road IoU75.7	159
Semantic segmentation	Synthia to Cityscapes (test)	Road IoU75.7	138
Semantic segmentation	GTA5 to Cityscapes 1.0 (val)	Road IoU92.5	98
Semantic segmentation	GTA to Cityscapes	Road IoU92.5	72
Semantic segmentation	Cityscapes (val)	Road IoU75.7	29

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord