Mantis: Mamba-native Tuning is Efficient for 3D Point Cloud Foundation Models
About
Pre-trained 3D point cloud foundation models (PFMs) have demonstrated strong transferability across diverse downstream tasks. However, full fine-tuning these models is computationally expensive and storage-intensive. Parameter-efficient fine-tuning (PEFT) offers a promising alternative, but existing PEFT approaches are primarily designed for Transformer-based backbones and rely on token-level prompting or feature transformation. Mamba-based backbones introduce a granularity mismatch between token-level adaptation and state-level sequence dynamics. Consequently, straightforward transfer of existing PEFT approaches to frozen Mamba backbones leads to substantial accuracy degradation and unstable optimization. To address this issue, we propose Mantis, the first Mamba-native PEFT framework for 3D PFMs. Specifically, a State-Aware Adapter (SAA) is introduced to inject lightweight task-conditioned control signals into selective state-space updates, enabling state-level adaptation while keeping the pre-trained backbone frozen. Moreover, different valid point cloud serializations are regularized by Dual-Serialization Consistency Distillation (DSCD), thereby reducing serialization-induced instability. Extensive experiments across multiple benchmarks demonstrate that our Mantis achieves competitive performance with only about 5% trainable parameters. Our code is available at https://github.com/gzhhhhhhh/Mantis.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic segmentation | S3DIS (Area 5) | mIOU57.1 | 1006 | |
| Part Segmentation | ShapeNetPart (test) | mIoU (Inst.)86.1 | 347 | |
| Few-shot classification | ModelNet40 10-way 20-shot | Accuracy96.4 | 117 | |
| Few-shot classification | ModelNet40 10-way 10-shot | Accuracy93.4 | 117 | |
| Few-shot classification | ModelNet40 5-way 20-shot | Accuracy99 | 102 | |
| Few-shot classification | ModelNet40 5-way 10-shot | Accuracy97.2 | 102 | |
| Object Classification | ModelNet40 | Overall Accuracy94.7 | 67 | |
| Object Classification | ScanObjectNN PB T50 RS | Overall Accuracy93.48 | 35 | |
| Object Classification | ScanObjectNN OBJ_BG variant | Overall Accuracy94.65 | 26 | |
| Object Classification | ScanObjectNN OBJ_ONLY variant | Overall Accuracy93.27 | 26 |