Target-Guided Adversarial Point Cloud Transformer Towards Recognition Against Real-world Corruptions
About
Achieving robust 3D perception in the face of corrupted data presents an challenging hurdle within 3D vision research. Contemporary transformer-based point cloud recognition models, albeit advanced, tend to overfit to specific patterns, consequently undermining their robustness against corruption. In this work, we introduce the Target-Guided Adversarial Point Cloud Transformer, termed APCT, a novel architecture designed to augment global structure capture through an adversarial feature erasing mechanism predicated on patterns discerned at each step during training. Specifically, APCT integrates an Adversarial Significance Identifier and a Target-guided Promptor. The Adversarial Significance Identifier, is tasked with discerning token significance by integrating global contextual analysis, utilizing a structural salience index algorithm alongside an auxiliary supervisory mechanism. The Target-guided Promptor, is responsible for accentuating the propensity for token discard within the self-attention mechanism, utilizing the value derived above, consequently directing the model attention towards alternative segments in subsequent stages. By iteratively applying this strategy in multiple steps during training, the network progressively identifies and integrates an expanded array of object-associated patterns. Extensive experiments demonstrate that our method achieves state-of-the-art results on multiple corruption benchmarks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Point Cloud Classification | ScanObjectNN PB_T50_RS (test) | Overall Accuracy86.2 | 91 | |
| Point Cloud Classification | ModelNet-C (test) | mCE72.2 | 58 | |
| Classification | ScanObjectNN-C | RmCE74.2 | 24 | |
| Segmentation | ShapeNet-C (test) | RmCE (Robustness Metric)55.3 | 20 | |
| Classification | ModelNet40-C (test) | RmCE Error51.4 | 11 | |
| Part Segmentation | ShapeNet-C | mCE81.4 | 10 | |
| Classification | ScanObjectNN-C 1.0 (test) | mOA72.4 | 8 | |
| Point Cloud Classification | MVImageNet (test) | Overall Accuracy86.6 | 6 |