Few-shot 3D Point Cloud Semantic Segmentation
About
Many existing approaches for 3D point cloud semantic segmentation are fully supervised. These fully supervised approaches heavily rely on large amounts of labeled training data that are difficult to obtain and cannot segment new classes after training. To mitigate these limitations, we propose a novel attention-aware multi-prototype transductive few-shot point cloud semantic segmentation method to segment new classes given a few labeled examples. Specifically, each class is represented by multiple prototypes to model the complex data distribution of labeled points. Subsequently, we employ a transductive label propagation method to exploit the affinities between labeled multi-prototypes and unlabeled points, and among the unlabeled points. Furthermore, we design an attention-aware multi-level feature learning network to learn the discriminative features that capture the geometric dependencies and semantic correlations between points. Our proposed method shows significant and consistent improvements compared to baselines in different few-shot point cloud semantic segmentation settings (i.e., 2/3-way 1/5-shot) on two benchmark datasets. Our code is available at https://github.com/Na-Z/attMPTI.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Few-shot 3D Scene Segmentation | ScanNet Avg | mIoU52.16 | 61 | |
| Few-shot 3D Scene Segmentation | ScanNet S0 | mIoU54 | 60 | |
| Few-shot 3D Scene Segmentation | ScanNet S1 | mIoU50.32 | 60 | |
| 3D Semantic Segmentation | S3DIS (S0, S1) | mIoU (S0)61.67 | 40 | |
| Few-shot 3D Point Cloud Semantic Segmentation | S3DIS v1.2 (Area 5) | mIoU46.71 | 40 | |
| 3D Semantic Segmentation | ScanNet S0 | mIoU54 | 36 | |
| 3D Point Cloud Semantic Segmentation | ScanNet official (fold S1) | mIoU37.15 | 24 | |
| 3D Point Cloud Semantic Segmentation | ScanNet Mean Fold official | mIoU38.12 | 24 | |
| Few-shot 3D Point Cloud Semantic Segmentation | ScanNet V2 | mIoU (S0)39.09 | 24 | |
| Few-shot 3D Point Cloud Semantic Segmentation | S3DIS (Mean across folds) | mIoU44.71 | 20 |