FoldingNet: Point Cloud Auto-encoder via Deep Grid Deformation

About

Recent deep networks that directly handle points in a point set, e.g., PointNet, have been state-of-the-art for supervised learning tasks on point clouds such as classification and segmentation. In this work, a novel end-to-end deep auto-encoder is proposed to address unsupervised learning challenges on point clouds. On the encoder side, a graph-based enhancement is enforced to promote local structures on top of PointNet. Then, a novel folding-based decoder deforms a canonical 2D grid onto the underlying 3D object surface of a point cloud, achieving low reconstruction errors even for objects with delicate structures. The proposed decoder only uses about 7% parameters of a decoder with fully-connected neural networks, yet leads to a more discriminative representation that achieves higher linear SVM classification accuracy than the benchmark. In addition, the proposed decoder structure is shown, in theory, to be a generic architecture that is able to reconstruct an arbitrary point cloud from a 2D grid. Our code is available at http://www.merl.com/research/license#FoldingNet

Yaoqing Yang, Chen Feng, Yiru Shen, Dong Tian• 2017

Related benchmarks

Task	Dataset	Result
3D Object Classification	ModelNet40 (test)	Accuracy88.4	321
Shape classification	ModelNet40 (test)	--	255
3D Shape Classification	ModelNet40 (test)	Accuracy88.4	227
Object Classification	ModelNet40 (test)	Accuracy88.4	180
Classification	ModelNet40 (test)	Accuracy88.4	120
Few-shot classification	ModelNet40 10-way 10-shot	Accuracy18.6	117
Few-shot classification	ModelNet40 10-way 20-shot	Accuracy15.4	117
Few-shot classification	ModelNet40 5-way 10-shot	Accuracy33.4	102
Few-shot classification	ModelNet40 5-way 20-shot	Accuracy35.8	102
Point Cloud Completion	PCN (test)	Watercraft51.4	88

Showing 10 of 70 rows

Other info

Code

Follow for update

@wizwand_team Discord