Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation

About

In this paper, we study the task of synthetic-to-real domain generalized semantic segmentation, which aims to learn a model that is robust to unseen real-world scenes using only synthetic data. The large domain shift between synthetic and real-world data, including the limited source environmental variations and the large distribution gap between synthetic and real-world data, significantly hinders the model performance on unseen real-world scenes. In this work, we propose the Style-HAllucinated Dual consistEncy learning (SHADE) framework to handle such domain shift. Specifically, SHADE is constructed based on two consistency constraints, Style Consistency (SC) and Retrospection Consistency (RC). SC enriches the source situations and encourages the model to learn consistent representation across style-diversified samples. RC leverages real-world knowledge to prevent the model from overfitting to synthetic data and thus largely keeps the representation consistent between the synthetic and real-world models. Furthermore, we present a novel style hallucination module (SHM) to generate style-diversified samples that are essential to consistency learning. SHM selects basis styles from the source distribution, enabling the model to dynamically generate diverse and realistic samples during training. Experiments show that our SHADE yields significant improvement and outperforms state-of-the-art methods by 5.05% and 8.35% on the average mIoU of three real-world datasets on single- and multi-source settings, respectively.

Yuyang Zhao, Zhun Zhong, Na Zhao, Nicu Sebe, Gim Hee Lee• 2022

Related benchmarks

TaskDatasetResultRank
Semantic segmentationCityscapes (test)
mIoU47.43
1145
Semantic segmentationCityscapes (val)
mIoU46.66
332
Semantic segmentationMapillary (val)
mIoU55
153
Semantic segmentationCityscapes 1.0 (val)
mIoU44.65
110
Semantic segmentationBDD-100K (val)
mIoU43.66
102
Semantic segmentationCityScapes, BDD, and Mapillary (val)
Mean mIoU45.3
85
Semantic segmentationBDD100K
mIoU50.95
78
Semantic segmentationMapillary
mIoU60.67
75
Semantic segmentationBDD100K (val)
mIoU48.2
72
Semantic segmentationMapillary Vistas (val)
mIoU45.5
72
Showing 10 of 39 rows

Other info

Code

Follow for update