Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models

About

The popularity of pre-trained large models has revolutionized downstream tasks across diverse fields, such as language, vision, and multi-modality. To minimize the adaption cost for downstream tasks, many Parameter-Efficient Fine-Tuning (PEFT) techniques are proposed for language and 2D image pre-trained models. However, the specialized PEFT method for 3D pre-trained models is still under-explored. To this end, we introduce Point-PEFT, a novel framework for adapting point cloud pre-trained models with minimal learnable parameters. Specifically, for a pre-trained 3D model, we freeze most of its parameters, and only tune the newly added PEFT modules on downstream tasks, which consist of a Point-prior Prompt and a Geometry-aware Adapter. The Point-prior Prompt adopts a set of learnable prompt tokens, for which we propose to construct a memory bank with domain-specific knowledge, and utilize a parameter-free attention to enhance the prompt tokens. The Geometry-aware Adapter aims to aggregate point cloud features within spatial neighborhoods to capture fine-grained geometric information through local interactions. Extensive experiments indicate that our Point-PEFT can achieve better performance than the full fine-tuning on various downstream tasks, while using only 5% of the trainable parameters, demonstrating the efficiency and effectiveness of our approach. Code is released at https://github.com/Ivan-Tang-3D/Point-PEFT.

Yiwen Tang, Ray Zhang, Zoey Guo, Dong Wang, Zhigang Wang, Bin Zhao, Xuelong Li• 2023

Related benchmarks

TaskDatasetResultRank
Part SegmentationShapeNetPart (test)
mIoU (Inst.)85.2
347
Object ClassificationScanObjectNN OBJ_BG
Accuracy96.39
248
Part SegmentationShapeNetPart
mIoU (Instance)85.3
246
Point Cloud ClassificationModelNet40 (test)
Accuracy93.4
229
Object ClassificationScanObjectNN PB_T50_RS
Accuracy92.85
220
Object ClassificationScanObjectNN OBJ_ONLY
Overall Accuracy94.66
186
Object ClassificationModelNet40 (test)--
180
Point Cloud ClassificationScanObjectNN PB_T50_RS (test)
Overall Accuracy86.36
91
Object DetectionScanNet v2 (test)
AP@0.5041.1
70
Few-shot 3D Object Classification (5-way)ModelNet40 (test)--
57
Showing 10 of 25 rows

Other info

Follow for update