PixMatch: Unsupervised Domain Adaptation via Pixelwise Consistency Training
About
Unsupervised domain adaptation is a promising technique for semantic segmentation and other computer vision tasks for which large-scale data annotation is costly and time-consuming. In semantic segmentation, it is attractive to train models on annotated images from a simulated (source) domain and deploy them on real (target) domains. In this work, we present a novel framework for unsupervised domain adaptation based on the notion of target-domain consistency training. Intuitively, our work is based on the idea that in order to perform well on the target domain, a model's output should be consistent with respect to small perturbations of inputs in the target domain. Specifically, we introduce a new loss term to enforce pixelwise consistency between the model's predictions on a target image and a perturbed version of the same image. In comparison to popular adversarial adaptation methods, our approach is simpler, easier to implement, and more memory-efficient during training. Experiments and extensive ablation studies demonstrate that our simple approach achieves remarkably strong results on two challenging synthetic-to-real benchmarks, GTA5-to-Cityscapes and SYNTHIA-to-Cityscapes. Code is available at: https://github.com/lukemelas/pixmatch
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | GTA5 → Cityscapes (val) | mIoU50.3 | 533 | |
| Semantic segmentation | SYNTHIA to Cityscapes (val) | Rider IoU85.7 | 435 | |
| Semantic segmentation | SYNTHIA to Cityscapes | Road IoU92.5 | 150 | |
| Semantic segmentation | GTA to Cityscapes | Road IoU91.6 | 72 | |
| Semantic segmentation | Cityscapes (val) | IoU (road)91.6 | 37 | |
| Semantic segmentation | Cityscapes (val) | Road IoU92.5 | 29 | |
| Open-Set Domain Adaptation Semantic Segmentation | GTA5 → Cityscapes (test) | Road79.27 | 17 | |
| Semantic segmentation | SYNTHIA-Seq to Cityscapes-Seq (val) | Road IoU90.2 | 14 | |
| Video Semantic Segmentation | VIPER -> Cityscapes-Seq (val) | Road IoU79.4 | 14 | |
| Open-Set Domain Adaptation Semantic Segmentation | SYNTHIA → Cityscapes (test) | Road IoU74.16 | 13 |