Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Point-PEFT: Parameter-Efficient Fine-Tuning for 3D Pre-trained Models

About

The popularity of pre-trained large models has revolutionized downstream tasks across diverse fields, such as language, vision, and multi-modality. To minimize the adaption cost for downstream tasks, many Parameter-Efficient Fine-Tuning (PEFT) techniques are proposed for language and 2D image pre-trained models. However, the specialized PEFT method for 3D pre-trained models is still under-explored. To this end, we introduce Point-PEFT, a novel framework for adapting point cloud pre-trained models with minimal learnable parameters. Specifically, for a pre-trained 3D model, we freeze most of its parameters, and only tune the newly added PEFT modules on downstream tasks, which consist of a Point-prior Prompt and a Geometry-aware Adapter. The Point-prior Prompt adopts a set of learnable prompt tokens, for which we propose to construct a memory bank with domain-specific knowledge, and utilize a parameter-free attention to enhance the prompt tokens. The Geometry-aware Adapter aims to aggregate point cloud features within spatial neighborhoods to capture fine-grained geometric information through local interactions. Extensive experiments indicate that our Point-PEFT can achieve better performance than the full fine-tuning on various downstream tasks, while using only 5% of the trainable parameters, demonstrating the efficiency and effectiveness of our approach. Code is released at https://github.com/Ivan-Tang-3D/Point-PEFT.

Yiwen Tang, Ray Zhang, Zoey Guo, Dong Wang, Zhigang Wang, Bin Zhao, Xuelong Li• 2023

Related benchmarks

TaskDatasetResultRank
Part SegmentationShapeNetPart (test)
mIoU (Inst.)85.2
312
Point Cloud ClassificationModelNet40 (test)
Accuracy93.4
224
Object ClassificationScanObjectNN OBJ_BG
Accuracy96.39
215
Part SegmentationShapeNetPart
mIoU (Instance)85.2
198
Object ClassificationScanObjectNN PB_T50_RS
Accuracy92.85
195
Object ClassificationModelNet40 (test)--
180
Object ClassificationScanObjectNN OBJ_ONLY
Overall Accuracy94.66
166
Point Cloud ClassificationScanObjectNN PB_T50_RS (test)
Overall Accuracy86.36
91
Object DetectionScanNet v2 (test)
AP@0.5041.1
70
Few-shot 3D Object Classification (5-way)ModelNet40 (test)--
57
Showing 10 of 21 rows

Other info

Follow for update