Point-JEPA: A Joint Embedding Predictive Architecture for Self-Supervised Learning on Point Cloud

About

Recent advancements in self-supervised learning in the point cloud domain have demonstrated significant potential. However, these methods often suffer from drawbacks, including lengthy pre-training time, the necessity of reconstruction in the input space, or the necessity of additional modalities. In order to address these issues, we introduce Point-JEPA, a joint embedding predictive architecture designed specifically for point cloud data. To this end, we introduce a sequencer that orders point cloud patch embeddings to efficiently compute and utilize their proximity based on the indices during target and context selection. The sequencer also allows shared computations of the patch embeddings' proximity between context and target selection, further improving the efficiency. Experimentally, our method achieves competitive results with state-of-the-art methods while avoiding the reconstruction in the input space or additional modality.

Ayumu Saito, Prachi Kudeshia, Jiju Poovvancheri• 2024

Related benchmarks

Task	Dataset	Result
3D Point Cloud Classification	ModelNet40 (test)	--	307
Object Classification	ScanObjectNN OBJ_BG	Accuracy93.2	248
Object Classification	ScanObjectNN PB_T50_RS	Accuracy87.6	220
Object Classification	ScanObjectNN OBJ_ONLY	Overall Accuracy91.9	186
Classification	ModelNet40 (test)	--	120
Classification	ModelNet40	Accuracy98.2	108
Point Cloud Classification	ScanObjectNN PB_T50_RS	Overall Accuracy86.05	100
Point Cloud Classification	ScanObjectNN OBJ_BG	Overall Accuracy91.84	66
Object Classification	ScanObjectNN	Accuracy (OBJ_ONLY)90.1	46
3D Point Cloud Classification	MN40	Accuracy93.02	21

Showing 10 of 14 rows

Other info

Code

Follow for update

@wizwand_team Discord