LaserMix for Semi-Supervised LiDAR Semantic Segmentation
About
Densely annotating LiDAR point clouds is costly, which restrains the scalability of fully-supervised learning methods. In this work, we study the underexplored semi-supervised learning (SSL) in LiDAR segmentation. Our core idea is to leverage the strong spatial cues of LiDAR point clouds to better exploit unlabeled data. We propose LaserMix to mix laser beams from different LiDAR scans, and then encourage the model to make consistent and confident predictions before and after mixing. Our framework has three appealing properties: 1) Generic: LaserMix is agnostic to LiDAR representations (e.g., range view and voxel), and hence our SSL framework can be universally applied. 2) Statistically grounded: We provide a detailed analysis to theoretically explain the applicability of the proposed framework. 3) Effective: Comprehensive experimental analysis on popular LiDAR segmentation datasets (nuScenes, SemanticKITTI, and ScribbleKITTI) demonstrates our effectiveness and superiority. Notably, we achieve competitive results over fully-supervised counterparts with 2x to 5x fewer labels and improve the supervised-only baseline significantly by 10.8% on average. We hope this concise yet high-performing framework could facilitate future research in semi-supervised LiDAR segmentation. Code is publicly available.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| LiDAR Semantic Segmentation | nuScenes (val) | mIoU75.2 | 169 | |
| LiDAR Semantic Segmentation | SemanticKITTI (val) | mIoU62.4 | 87 | |
| Semantic segmentation | SemanticKITTI v1.0 (test) | mIoU62.3 | 71 | |
| 3D Semantic Segmentation | SemanticKITTI (val) | mIoU62.3 | 54 | |
| Semantic segmentation | NuScenes v1.0 (test) | mIoU73.2 | 44 | |
| Semantic segmentation | KITTI-360 (val) | mIoU62.3 | 36 | |
| LiDAR Semantic Segmentation | ScribbleKITTI (train) | mIoU56.8 | 34 | |
| LiDAR Semantic Segmentation | SemanticKITTI (train) | mIoU62.3 | 30 | |
| 3D Point Cloud Semantic Segmentation | SemanticSTF SemanticKITTI (val) | mIoU14.7 | 23 | |
| Semantic segmentation | semanticKITTI SynLiDAR source (val) | mIoU (Mean IoU)36 | 21 |