Transductive Zero-Shot Learning for 3D Point Cloud Classification
About
Zero-shot learning, the task of learning to recognize new classes not seen during training, has received considerable attention in the case of 2D image classification. However despite the increasing ubiquity of 3D sensors, the corresponding 3D point cloud classification problem has not been meaningfully explored and introduces new challenges. This paper extends, for the first time, transductive Zero-Shot Learning (ZSL) and Generalized Zero-Shot Learning (GZSL) approaches to the domain of 3D point cloud classification. To this end, a novel triplet loss is developed that takes advantage of unlabeled test data. While designed for the task of 3D point cloud classification, the method is also shown to be applicable to the more common use-case of 2D image classification. An extensive set of experiments is carried out, establishing state-of-the-art for ZSL and GZSL in the 3D point cloud domain, as well as demonstrating the applicability of the approach to the image domain.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Object Classification | ModelNet40 | -- | 62 | |
| 3D Semantic Segmentation | ScanNet B12 N7 | hIoU380 | 20 | |
| 3D Semantic Segmentation | ScanNet B10/N9 | hIoU7.8 | 20 | |
| 3D Semantic Segmentation | S3DIS (B8/N4) | hIoU840 | 19 | |
| 3D Semantic Segmentation | S3DIS B6 N6 | hIoU3.5 | 19 | |
| 3D Semantic Segmentation | ScanNet B15 N4 | hIoU10.5 | 13 | |
| 3D Object Classification | McGill | Accuracy13 | 11 | |
| 3D Object Classification | SHREC 2015 | Accuracy0.052 | 11 | |
| 3D Semantic Segmentation | ScanNet B15 N4 v2 | hIoU10.5 | 7 | |
| Open Vocabulary Semantic Segmentation | ScanNet V2 (N4) | mIoU6.1 | 7 |