Learning Texture Invariant Representation for Domain Adaptation of Semantic Segmentation

About

Since annotating pixel-level labels for semantic segmentation is laborious, leveraging synthetic data is an attractive solution. However, due to the domain gap between synthetic domain and real domain, it is challenging for a model trained with synthetic data to generalize to real data. In this paper, considering the fundamental difference between the two domains as the texture, we propose a method to adapt to the texture of the target domain. First, we diversity the texture of synthetic images using a style transfer algorithm. The various textures of generated images prevent a segmentation model from overfitting to one specific (synthetic) texture. Then, we fine-tune the model with self-training to get direct supervision of the target texture. Our results achieve state-of-the-art performance and we analyze the properties of the model trained on the stylized dataset with extensive experiments.

Myeongjin Kim, Hyeran Byun• 2020

Related benchmarks

Task	Dataset	Result
Semantic segmentation	GTA5 → Cityscapes (val)	mIoU50.2	586
Semantic segmentation	SYNTHIA to Cityscapes (val)	Rider IoU52.6	480
Semantic segmentation	Cityscapes GTA5 to Cityscapes adaptation (val)	mIoU (Overall)50.2	360
Semantic segmentation	SYNTHIA to Cityscapes	Road IoU92.6	159
Semantic segmentation	Synthia to Cityscapes (test)	Road IoU92.6	138
Semantic segmentation	Cityscapes (val)	mIoU50.2	133
Semantic segmentation	GTA5 to Cityscapes 1.0 (val)	Road IoU92.9	98
Semantic segmentation	GTA to Cityscapes	Road IoU92.9	72
Semantic segmentation	Cityscapes trained on SYNTHIA (val)	Road IoU92.6	60
Semantic segmentation	Cityscapes GTA5 source 1.0 (val)	mIoU50.2	49

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord