Rethinking Few-shot 3D Point Cloud Semantic Segmentation
About
This paper revisits few-shot 3D point cloud semantic segmentation (FS-PCS), with a focus on two significant issues in the state-of-the-art: foreground leakage and sparse point distribution. The former arises from non-uniform point sampling, allowing models to distinguish the density disparities between foreground and background for easier segmentation. The latter results from sampling only 2,048 points, limiting semantic information and deviating from the real-world practice. To address these issues, we introduce a standardized FS-PCS setting, upon which a new benchmark is built. Moreover, we propose a novel FS-PCS model. While previous methods are based on feature optimization by mainly refining support features to enhance prototypes, our method is based on correlation optimization, referred to as Correlation Optimization Segmentation (COSeg). Specifically, we compute Class-specific Multi-prototypical Correlation (CMC) for each query point, representing its correlations to category prototypes. Then, we propose the Hyper Correlation Augmentation (HCA) module to enhance CMC. Furthermore, tackling the inherent property of few-shot training to incur base susceptibility for models, we propose to learn non-parametric prototypes for the base classes during training. The learned base prototypes are used to calibrate correlations for the background class through a Base Prototypes Calibration (BPC) module. Experiments on popular datasets demonstrate the superiority of COSeg over existing methods. The code is available at: https://github.com/ZhaochongAn/COSeg
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Few-shot 3D Scene Segmentation | ScanNet Avg | mIoU46.5 | 61 | |
| Few-shot 3D Scene Segmentation | ScanNet S0 | mIoU49.43 | 60 | |
| Few-shot 3D Scene Segmentation | ScanNet S1 | mIoU43.57 | 60 | |
| Few-shot 3D Point Cloud Semantic Segmentation | S3DIS v1.2 (Area 5) | mIoU48.72 | 40 | |
| 3D Semantic Segmentation | ScanNet S0 | mIoU48.54 | 36 | |
| 3D Point Cloud Semantic Segmentation | ScanNet official (fold S1) | mIoU44.68 | 24 | |
| 3D Point Cloud Semantic Segmentation | ScanNet Mean Fold official | mIoU46.61 | 24 | |
| Few-shot 3D Point Cloud Semantic Segmentation | ScanNet V2 | mIoU (S0)48.54 | 24 | |
| Few-shot 3D Point Cloud Semantic Segmentation | S3DIS (Mean across folds) | mIoU48.6 | 20 | |
| Semantic segmentation | ScanNet++ new (test) | mIoU (Boundary)59.34 | 12 |