Open-Vocabulary Domain Generalization in Urban-Scene Segmentation
About
Domain Generalization in Semantic Segmentation (DG-SS) aims to enable segmentation models to perform robustly in unseen environments. However, conventional DG-SS methods are restricted to a fixed set of known categories, limiting their applicability in open-world scenarios. Recent progress in Vision-Language Models (VLMs) has advanced Open-Vocabulary Semantic Segmentation (OV-SS) by enabling models to recognize a broader range of concepts. Yet, these models remain sensitive to domain shifts and struggle to maintain robustness when deployed in unseen environments, a challenge that is particularly severe in urban-driving scenarios. To bridge this gap, we introduce Open-Vocabulary Domain Generalization in Semantic Segmentation (OVDG-SS), a new setting that jointly addresses unseen domains and unseen categories. We introduce the first benchmark for OVDG-SS in autonomous driving, addressing a previously unexplored problem and covering both synthetic-to-real and real-to-real generalization across diverse unseen domains and unseen categories. In OVDG-SS, we observe that domain shifts often distort text-image correlations in pre-trained VLMs, which hinders the performance of OV-SS models. To tackle this challenge, we propose S2-Corr, a state-space-driven text-image correlation refinement mechanism that mitigates domain-induced distortions and produces more consistent text-image correlations under distribution changes. Extensive experiments on our constructed benchmark demonstrate that the proposed method achieves superior cross-domain performance and efficiency compared to existing OV-SS approaches.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | Dv 19-class (val) | ACDC-19 Score54.3 | 46 | |
| Semantic segmentation | Dv 58-class (val) | ACDC-4162 | 46 | |
| Semantic segmentation | Mapillary Vistas | mIoU67.6 | 22 | |
| Semantic segmentation | Dv-19 Synthetic-to-Real (test) | CS-19 Score52.2 | 16 | |
| Semantic segmentation | Dv-58 Synthetic-to-Real (test) | ACDC-41 Score55.4 | 16 | |
| Semantic segmentation | BDD100K BDD-19 | mIoU61.8 | 7 | |
| Semantic segmentation | Cityscapes CS-19 | mIoU65.2 | 7 | |
| Open-Set Semantic Segmentation | FS Static | IoU74.6 | 5 | |
| Open-Set Semantic Segmentation | Lost & Found FS | IoU36.3 | 5 | |
| Open-Set Semantic Segmentation | SMIYC Anomaly | IoU82.9 | 5 |