Class-Balanced Pixel-Level Self-Labeling for Domain Adaptive Semantic Segmentation

About

Domain adaptive semantic segmentation aims to learn a model with the supervision of source domain data, and produce satisfactory dense predictions on unlabeled target domain. One popular solution to this challenging task is self-training, which selects high-scoring predictions on target samples as pseudo labels for training. However, the produced pseudo labels often contain much noise because the model is biased to source domain as well as majority categories. To address the above issues, we propose to directly explore the intrinsic pixel distributions of target domain data, instead of heavily relying on the source domain. Specifically, we simultaneously cluster pixels and rectify pseudo labels with the obtained cluster assignments. This process is done in an online fashion so that pseudo labels could co-evolve with the segmentation model without extra training rounds. To overcome the class imbalance problem on long-tailed categories, we employ a distribution alignment technique to enforce the marginal class distribution of cluster assignments to be close to that of pseudo labels. The proposed method, namely Class-balanced Pixel-level Self-Labeling (CPSL), improves the segmentation performance on target domain over state-of-the-arts by a large margin, especially on long-tailed categories.

Ruihuang Li, Shuai Li, Chenhang He, Yabin Zhang, Xu Jia, Lei Zhang• 2022

Related benchmarks

Task	Dataset	Result
Semantic segmentation	GTA5 → Cityscapes (val)	mIoU60.8	586
Semantic segmentation	SYNTHIA to Cityscapes (val)	Rider IoU32	480
Semantic segmentation	SYNTHIA to Cityscapes	Road IoU87.2	159
Semantic segmentation	GTA5 to Cityscapes (test)	mIoU61.7	151
Semantic segmentation	Synthia to Cityscapes (test)	--	138
Semantic segmentation	GTA5 to Cityscapes 1.0 (val)	Road IoU92.3	98
Semantic segmentation	SYNTHIA-to-Cityscapes 16 categories (val)	mIoU (Overall)54.4	74
Semantic segmentation	Cityscapes trained on SYNTHIA (val)	Road IoU87.2	60
Semantic segmentation	GTA5 to Cityscapes	mIoU60.8	58
Semantic segmentation	GTA to Cityscapes (val)	Road Accuracy92.3	55

Showing 10 of 11 rows

Other info

Code

Follow for update

@wizwand_team Discord