Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mantis: Mamba-native Tuning is Efficient for 3D Point Cloud Foundation Models

About

Pre-trained 3D point cloud foundation models (PFMs) have demonstrated strong transferability across diverse downstream tasks. However, full fine-tuning these models is computationally expensive and storage-intensive. Parameter-efficient fine-tuning (PEFT) offers a promising alternative, but existing PEFT approaches are primarily designed for Transformer-based backbones and rely on token-level prompting or feature transformation. Mamba-based backbones introduce a granularity mismatch between token-level adaptation and state-level sequence dynamics. Consequently, straightforward transfer of existing PEFT approaches to frozen Mamba backbones leads to substantial accuracy degradation and unstable optimization. To address this issue, we propose Mantis, the first Mamba-native PEFT framework for 3D PFMs. Specifically, a State-Aware Adapter (SAA) is introduced to inject lightweight task-conditioned control signals into selective state-space updates, enabling state-level adaptation while keeping the pre-trained backbone frozen. Moreover, different valid point cloud serializations are regularized by Dual-Serialization Consistency Distillation (DSCD), thereby reducing serialization-induced instability. Extensive experiments across multiple benchmarks demonstrate that our Mantis achieves competitive performance with only about 5% trainable parameters. Our code is available at https://github.com/gzhhhhhhh/Mantis.

Zihao Guo, Jihua Zhu, Jian Liu, Ajmal Saeed Mian• 2026

Related benchmarks

TaskDatasetResultRank
Semantic segmentationS3DIS (Area 5)
mIOU57.1
1006
Part SegmentationShapeNetPart (test)
mIoU (Inst.)86.1
347
Few-shot classificationModelNet40 10-way 20-shot
Accuracy96.4
117
Few-shot classificationModelNet40 10-way 10-shot
Accuracy93.4
117
Few-shot classificationModelNet40 5-way 20-shot
Accuracy99
102
Few-shot classificationModelNet40 5-way 10-shot
Accuracy97.2
102
Object ClassificationModelNet40
Overall Accuracy94.7
67
Object ClassificationScanObjectNN PB T50 RS
Overall Accuracy93.48
35
Object ClassificationScanObjectNN OBJ_BG variant
Overall Accuracy94.65
26
Object ClassificationScanObjectNN OBJ_ONLY variant
Overall Accuracy93.27
26
Showing 10 of 10 rows

Other info

Follow for update