DA-Cal: Towards Cross-Domain Calibration in Semantic Segmentation
About
While existing unsupervised domain adaptation (UDA) methods greatly enhance target domain performance in semantic segmentation, they often neglect network calibration quality, resulting in misalignment between prediction confidence and actual accuracy -- a significant risk in safety-critical applications. Our key insight emerges from observing that performance degrades substantially when soft pseudo-labels replace hard pseudo-labels in cross-domain scenarios due to poor calibration, despite the theoretical equivalence of perfectly calibrated soft pseudo-labels to hard pseudo-labels. Based on this finding, we propose DA-Cal, a dedicated cross-domain calibration framework that transforms target domain calibration into soft pseudo-label optimization. DA-Cal introduces a Meta Temperature Network to generate pixel-level calibration parameters and employs bi-level optimization to establish the relationship between soft pseudo-labels and UDA supervision, while utilizing complementary domain-mixing strategies to prevent overfitting and reduce domain discrepancies. Experiments demonstrate that DA-Cal seamlessly integrates with existing self-training frameworks across multiple UDA segmentation benchmarks, significantly improving target domain calibration while delivering performance gains without inference overhead. The code will be released.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Calibration for Unsupervised Domain Adaptation | GTAv to Cityscapes Source: Synthetic, Target: Real (test) | ECE4.59 | 21 | |
| Calibration for Unsupervised Domain Adaptation | Cityscapes to ACDC Normal weather to adverse weather (test) | ECE0.1056 | 21 | |
| Calibration for Unsupervised Domain Adaptation | SYNTHIA to Cityscapes 16 classes (test) | ECE0.0822 | 21 | |
| Semantic segmentation | MitoEM-R -> MitoEM-H v1 (test) | ECE0.0325 | 14 | |
| Semantic segmentation | VNC III -> Lucchi Subset1 (test) | ECE0.052 | 14 |