Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MM-TTA: Multi-Modal Test-Time Adaptation for 3D Semantic Segmentation

About

Test-time adaptation approaches have recently emerged as a practical solution for handling domain shift without access to the source domain data. In this paper, we propose and explore a new multi-modal extension of test-time adaptation for 3D semantic segmentation. We find that directly applying existing methods usually results in performance instability at test time because multi-modal input is not considered jointly. To design a framework that can take full advantage of multi-modality, where each modality provides regularized self-supervisory signals to other modalities, we propose two complementary modules within and across the modalities. First, Intra-modal Pseudolabel Generation (Intra-PG) is introduced to obtain reliable pseudo labels within each modality by aggregating information from two models that are both pre-trained on source data but updated with target data at different paces. Second, Inter-modal Pseudo-label Refinement (Inter-PR) adaptively selects more reliable pseudo labels from different modalities based on a proposed consistency scheme. Experiments demonstrate that our regularized pseudo labels produce stable self-learning signals in numerous multi-modal test-time adaptation scenarios for 3D semantic segmentation. Visit our project website at https://www.nec-labs.com/~mas/MM-TTA.

Inkyu Shin, Yi-Hsuan Tsai, Bingbing Zhuang, Samuel Schulter, Buyu Liu, Sparsh Garg, In So Kweon, Kuk-Jin Yoon• 2022

Related benchmarks

TaskDatasetResultRank
3D Semantic SegmentationnuScenes Day to Night (target)
mIoU 3D43.7
34
3D Semantic SegmentationA2D2 to SemanticKITTI (target)--
22
3D Semantic SegmentationSynthia to SemanticKITTI (target)
2D Score31.5
12
Showing 3 of 3 rows

Other info

Code

Follow for update