Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Open-Vocabulary Domain Generalization in Urban-Scene Segmentation

About

Domain Generalization in Semantic Segmentation (DG-SS) aims to enable segmentation models to perform robustly in unseen environments. However, conventional DG-SS methods are restricted to a fixed set of known categories, limiting their applicability in open-world scenarios. Recent progress in Vision-Language Models (VLMs) has advanced Open-Vocabulary Semantic Segmentation (OV-SS) by enabling models to recognize a broader range of concepts. Yet, these models remain sensitive to domain shifts and struggle to maintain robustness when deployed in unseen environments, a challenge that is particularly severe in urban-driving scenarios. To bridge this gap, we introduce Open-Vocabulary Domain Generalization in Semantic Segmentation (OVDG-SS), a new setting that jointly addresses unseen domains and unseen categories. We introduce the first benchmark for OVDG-SS in autonomous driving, addressing a previously unexplored problem and covering both synthetic-to-real and real-to-real generalization across diverse unseen domains and unseen categories. In OVDG-SS, we observe that domain shifts often distort text-image correlations in pre-trained VLMs, which hinders the performance of OV-SS models. To tackle this challenge, we propose S2-Corr, a state-space-driven text-image correlation refinement mechanism that mitigates domain-induced distortions and produces more consistent text-image correlations under distribution changes. Extensive experiments on our constructed benchmark demonstrate that the proposed method achieves superior cross-domain performance and efficiency compared to existing OV-SS approaches.

Dong Zhao, Qi Zang, Nan Pu, Wenjing Li, Nicu Sebe, Zhun Zhong• 2026

Related benchmarks

TaskDatasetResultRank
Semantic segmentationDv 19-class (val)
ACDC-19 Score54.3
46
Semantic segmentationDv 58-class (val)
ACDC-4162
46
Semantic segmentationMapillary Vistas
mIoU67.6
22
Semantic segmentationDv-19 Synthetic-to-Real (test)
CS-19 Score52.2
16
Semantic segmentationDv-58 Synthetic-to-Real (test)
ACDC-41 Score55.4
16
Semantic segmentationBDD100K BDD-19
mIoU61.8
7
Semantic segmentationCityscapes CS-19
mIoU65.2
7
Open-Set Semantic SegmentationFS Static
IoU74.6
5
Open-Set Semantic SegmentationLost & Found FS
IoU36.3
5
Open-Set Semantic SegmentationSMIYC Anomaly
IoU82.9
5
Showing 10 of 16 rows

Other info

Follow for update