Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Rethinking Multimodal Few-Shot 3D Point Cloud Segmentation: From Fused Refinement to Decoupled Arbitration

About

In this paper, we revisit multimodal few-shot 3D point cloud semantic segmentation (FS-PCS), identifying a conflict in "Fuse-then-Refine" paradigms: the "Plasticity-Stability Dilemma." In addition, CLIP's inter-class confusion can result in semantic blindness. To address these issues, we present the Decoupled-experts Arbitration Few-Shot SegNet (DA-FSS), a model that effectively distinguishes between semantic and geometric paths and mutually regularizes their gradients to achieve better generalization. DA-FSS employs the same backbone and pre-trained text encoder as MM-FSS to generate text embeddings, which can increase free modalities' utilization rate and better leverage each modality's information space. To achieve this, we propose a Parallel Expert Refinement module to generate each modal correlation. We also propose a Stacked Arbitration Module (SAM) to perform convolutional fusion and arbitrate correlations for each modality pathway. The Parallel Experts decouple two paths: a Geometric Expert maintains plasticity, and a Semantic Expert ensures stability. They are coordinated via a Decoupled Alignment Module (DAM) that transfers knowledge without propagating confusion. Experiments on popular datasets (S3DIS, ScanNet) demonstrate the superiority of DA-FSS over MM-FSS. Meanwhile, geometric boundaries, completeness, and texture differentiation are all superior to the baseline. The code is available at: https://github.com/MoWenQAQ/DA-FSS/.

Wentao Bian, Fenglei Xu• 2026

Related benchmarks

TaskDatasetResultRank
Few-shot 3D Point Cloud Semantic SegmentationS3DIS v1.2 (Area 5)
mIoU (S0)51.9
56
3D Semantic SegmentationScanNet S0
mIoU55.4
36
3D Point Cloud Semantic SegmentationScanNet official (fold S1)
mIoU46.3
24
3D Point Cloud Semantic SegmentationScanNet Mean Fold official
mIoU50.85
24
Few-shot 3D Point Cloud Semantic SegmentationScanNet V2
mIoU (S0)55.4
24
Showing 5 of 5 rows

Other info

Follow for update