ZigzagPointMamba: Spatial-Semantic Mamba for Point Cloud Understanding

About

State Space models (SSMs) such as PointMamba enable efficient feature extraction for point cloud self-supervised learning with linear complexity, outperforming Transformers in computational efficiency. However, existing PointMamba-based methods depend on complex token ordering and random masking, which disrupt spatial continuity and local semantic correlations. We propose ZigzagPointMamba to tackle these challenges. The core of our approach is a simple zigzag scan path that globally sequences point cloud tokens, enhancing spatial continuity by preserving the proximity of spatially adjacent point tokens. Nevertheless, random masking undermines local semantic modeling in self-supervised learning. To address this, we introduce a Semantic-Siamese Masking Strategy (SMS), which masks semantically similar tokens to facilitate reconstruction by integrating local features of original and similar tokens. This overcomes the dependence on isolated local features and enables robust global semantic modeling. Our pre-trained ZigzagPointMamba weights significantly improve downstream tasks, achieving a 1.59% mIoU gain on ShapeNetPart for part segmentation, a 0.4% higher accuracy on ModelNet40 for classification, and 0.19%, 1.22%, and 0.72% higher accuracies respectively for the classification tasks on the OBJ-BG, OBJ-ONLY, and PB-T50-RS subsets of ScanObjectNN.

Linshuang Diao, Sensen Song, Yurong Qian, Dayong Ren• 2025

Related benchmarks

Task	Dataset	Result
Part Segmentation	ShapeNetPart (test)	mIoU (Inst.)85.8	358
Few-shot classification	ModelNet40 10-way 20-shot	Accuracy94.2	117
Few-shot classification	ModelNet40 10-way 10-shot	Accuracy90	117
Few-shot classification	ModelNet40 5-way 20-shot	Accuracy99	102
Few-shot classification	ModelNet40 5-way 10-shot	Accuracy96	102
Object Classification	ModelNet40	Overall Accuracy93.2	78
Object Classification	ScanObjectNN PB T50 RS	Overall Accuracy88.65	47
Object Classification	ScanObjectNN OBJ_BG variant	Overall Accuracy94.15	38
Object Classification	ScanObjectNN OBJ_ONLY variant	Overall Accuracy92.1	38

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord