xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation
About
Unsupervised Domain Adaptation (UDA) is crucial to tackle the lack of annotations in a new domain. There are many multi-modal datasets, but most UDA approaches are uni-modal. In this work, we explore how to learn from multi-modality and propose cross-modal UDA (xMUDA) where we assume the presence of 2D images and 3D point clouds for 3D semantic segmentation. This is challenging as the two input spaces are heterogeneous and can be impacted differently by domain shift. In xMUDA, modalities learn from each other through mutual mimicking, disentangled from the segmentation objective, to prevent the stronger modality from adopting false predictions from the weaker one. We evaluate on new UDA scenarios including day-to-night, country-to-country and dataset-to-dataset, leveraging recent autonomous driving datasets. xMUDA brings large improvements over uni-modal UDA on all tested scenarios, and is complementary to state-of-the-art UDA techniques. Code is available at https://github.com/valeoai/xmuda.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Semantic Segmentation | nuScenes Day to Night (target) | mIoU 3D46.7 | 34 | |
| 3D Semantic Segmentation | A2D2 to SemanticKITTI (target) | -- | 22 | |
| Out-of-distribution segmentation | nuScenes | FPR@9544.32 | 16 | |
| 3D Semantic Segmentation | Synthia to SemanticKITTI (target) | 2D Score25.6 | 12 | |
| 3D Multimodal Panoptic Segmentation | nuScenes Day to Night v1.0 (target-unlabeled) | PQ69.7 | 9 | |
| 3D Multimodal Panoptic Segmentation | nuScenes USA to Singapore v1.0 (target-unlabeled) | Panoptic Quality (PQ)67.2 | 9 | |
| 3D Multimodal Panoptic Segmentation | SemanticKITTI to nuScenes (target-unlabeled) | PQ49.1 | 9 | |
| 3D Multimodal Panoptic Segmentation | nuScenes Sunny to Rainy v1.0 (target-unlabeled) | PQ62.2 | 9 | |
| Out-of-distribution segmentation | CARLA OOD | FPR@9597 | 8 | |
| Out-of-distribution segmentation | SemanticKITTI | FPR@9555.37 | 8 |