Unsupervised Point Cloud Pre-Training via Occlusion Completion

About

We describe a simple pre-training approach for point clouds. It works in three steps: 1. Mask all points occluded in a camera view; 2. Learn an encoder-decoder model to reconstruct the occluded points; 3. Use the encoder weights as initialisation for downstream point cloud tasks. We find that even when we construct a single pre-training dataset (from ModelNet40), this pre-training method improves accuracy across different datasets and encoders, on a wide range of downstream tasks. Specifically, we show that our method outperforms previous pre-training methods in object classification, and both part-based and semantic segmentation tasks. We study the pre-trained features and find that they lead to wide downstream minima, have high transformation invariance, and have activations that are highly correlated with part labels. Code and data are available at: https://github.com/hansen7/OcCo

Hanchen Wang, Qi Liu, Xiangyu Yue, Joan Lasenby, Matthew J. Kusner• 2020

Related benchmarks

Task	Dataset	Result
Semantic segmentation	S3DIS (Area 5)	mIOU55.4	1029
Part Segmentation	ShapeNetPart (test)	mIoU (Inst.)85.1	358
Semantic segmentation	S3DIS (6-fold)	mIoU (Mean IoU)58.5	344
3D Point Cloud Classification	ModelNet40 (test)	OA92.2	307
Shape classification	ModelNet40 (test)	OA92.9	255
Part Segmentation	ShapeNetPart	mIoU (Instance)85.1	254
Object Classification	ScanObjectNN OBJ_BG	Accuracy88.2	248
Point Cloud Classification	ModelNet40 (test)	Accuracy93	229
Object Classification	ScanObjectNN PB_T50_RS	Accuracy78.79	220
Object Classification	ScanObjectNN OBJ_ONLY	Overall Accuracy85.54	186

Showing 10 of 70 rows

Other info

Follow for update

@wizwand_team Discord