All about Structure: Adapting Structural Information across Domains for Boosting Semantic Segmentation
About
In this paper we tackle the problem of unsupervised domain adaptation for the task of semantic segmentation, where we attempt to transfer the knowledge learned upon synthetic datasets with ground-truth labels to real-world images without any annotation. With the hypothesis that the structural content of images is the most informative and decisive factor to semantic segmentation and can be readily shared across domains, we propose a Domain Invariant Structure Extraction (DISE) framework to disentangle images into domain-invariant structure and domain-specific texture representations, which can further realize image-translation across domains and enable label transfer to improve segmentation performance. Extensive experiments verify the effectiveness of our proposed DISE model and demonstrate its superiority over several state-of-the-art approaches.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | SYNTHIA to Cityscapes (val) | Rider IoU19.2 | 435 | |
| Semantic segmentation | Cityscapes GTA5 to Cityscapes adaptation (val) | mIoU (Overall)45.4 | 352 | |
| Semantic segmentation | GTA5 to Cityscapes (test) | mIoU45.4 | 151 | |
| Semantic segmentation | Synthia to Cityscapes (test) | Road IoU91.7 | 138 | |
| Semantic segmentation | Cityscapes adaptation from Synthia 1.0 (val) | Person IoU55.8 | 114 | |
| Semantic segmentation | Cityscapes 1.0 (val) | mIoU45.4 | 110 | |
| Semantic segmentation | Foggy Driving (FD) (test) | mIoU45.2 | 56 | |
| Semantic segmentation | Foggy Zurich (test) | mIoU40.7 | 51 | |
| Semantic segmentation | SYNTHIA-to-Cityscapes (SYN2CS) 16 classes (val) | IoU41.5 | 50 |