FoldingNet: Point Cloud Auto-encoder via Deep Grid Deformation
About
Recent deep networks that directly handle points in a point set, e.g., PointNet, have been state-of-the-art for supervised learning tasks on point clouds such as classification and segmentation. In this work, a novel end-to-end deep auto-encoder is proposed to address unsupervised learning challenges on point clouds. On the encoder side, a graph-based enhancement is enforced to promote local structures on top of PointNet. Then, a novel folding-based decoder deforms a canonical 2D grid onto the underlying 3D object surface of a point cloud, achieving low reconstruction errors even for objects with delicate structures. The proposed decoder only uses about 7% parameters of a decoder with fully-connected neural networks, yet leads to a more discriminative representation that achieves higher linear SVM classification accuracy than the benchmark. In addition, the proposed decoder structure is shown, in theory, to be a generic architecture that is able to reconstruct an arbitrary point cloud from a 2D grid. Our code is available at http://www.merl.com/research/license#FoldingNet
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| 3D Object Classification | ModelNet40 (test) | Accuracy88.4 | 302 | |
| Shape classification | ModelNet40 (test) | -- | 255 | |
| 3D Shape Classification | ModelNet40 (test) | Accuracy88.4 | 227 | |
| Object Classification | ModelNet40 (test) | Accuracy88.4 | 180 | |
| Classification | ModelNet40 (test) | Accuracy88.4 | 99 | |
| Few-shot classification | ModelNet40 (test) | Mean Accuracy35.8 | 68 | |
| 3D shape recognition | ModelNet10 (test) | Accuracy94.4 | 64 | |
| Point Cloud Completion | PCN (test) | Watercraft14.99 | 60 | |
| 3D Object Classification | ModelNet10 (test) | Mean Class Accuracy94.4 | 57 | |
| Few-shot 3D Object Classification (5-way) | ModelNet40 (test) | 10-shot Accuracy33.4 | 57 |