Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Parameter-efficient Prompt Learning for 3D Point Cloud Understanding

About

This paper presents a parameter-efficient prompt tuning method, named PPT, to adapt a large multi-modal model for 3D point cloud understanding. Existing strategies are quite expensive in computation and storage, and depend on time-consuming prompt engineering. We address the problems from three aspects. Firstly, a PromptLearner module is devised to replace hand-crafted prompts with learnable contexts to automate the prompt tuning process. Then, we lock the pre-trained backbone instead of adopting the full fine-tuning paradigm to substantially improve the parameter efficiency. Finally, a lightweight PointAdapter module is arranged near target tasks to enhance prompt tuning for 3D point cloud understanding. Comprehensive experiments are conducted to demonstrate the superior parameter and data efficiency of the proposed method.Meanwhile, we obtain new records on 4 public datasets and multiple 3D tasks, i.e., point cloud recognition, few-shot learning, and part segmentation. The implementation is available at https://github.com/auniquesun/PPT.

Hongyu Sun, Yongcai Wang, Wang Chen, Haoran Deng, Deying Li• 2024

Related benchmarks

TaskDatasetResultRank
Object ClassificationScanObjectNN OBJ_BG
Accuracy40.3
248
3D Object ClassificationModelNet40--
89
3D Object ClassificationScanObjectNN OBJ_ONLY
Overall Accuracy39.8
83
3D ClassificationObjaverse LVIS
Top-1 Acc41
61
3D ClassificationScanObjectNN OBJ-BG official
Accuracy95.4
37
3D Object ClassificationScanObjectNN PB
Accuracy89.1
29
3D Object ClassificationScanObjectNN ONLY
Accuracy93.1
23
3D Object ClassificationTarget Dataset Aggregate MN40, ONLY, BG, PB
Accuracy41.9
5
Showing 8 of 8 rows

Other info

Follow for update