Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Point-JEPA: A Joint Embedding Predictive Architecture for Self-Supervised Learning on Point Cloud

About

Recent advancements in self-supervised learning in the point cloud domain have demonstrated significant potential. However, these methods often suffer from drawbacks, including lengthy pre-training time, the necessity of reconstruction in the input space, or the necessity of additional modalities. In order to address these issues, we introduce Point-JEPA, a joint embedding predictive architecture designed specifically for point cloud data. To this end, we introduce a sequencer that orders point cloud patch embeddings to efficiently compute and utilize their proximity based on the indices during target and context selection. The sequencer also allows shared computations of the patch embeddings' proximity between context and target selection, further improving the efficiency. Experimentally, our method achieves competitive results with state-of-the-art methods while avoiding the reconstruction in the input space or additional modality.

Ayumu Saito, Prachi Kudeshia, Jiju Poovvancheri• 2024

Related benchmarks

TaskDatasetResultRank
3D Point Cloud ClassificationModelNet40 (test)--
297
Object ClassificationScanObjectNN OBJ_BG
Accuracy93.2
215
Object ClassificationScanObjectNN PB_T50_RS
Accuracy87.6
195
Object ClassificationScanObjectNN OBJ_ONLY
Overall Accuracy91.9
166
ClassificationModelNet40 (test)--
99
Object ClassificationScanObjectNN--
29
3D Object ClassificationModelNet 1k points 40 (test)
Accuracy93.8
15
Part SegmentationShapeNetPart 6 (test)
mIoU (Class)83.9
7
Showing 8 of 8 rows

Other info

Code

Follow for update