Target-Guided Adversarial Point Cloud Transformer Towards Recognition Against Real-world Corruptions

About

Achieving robust 3D perception in the face of corrupted data presents an challenging hurdle within 3D vision research. Contemporary transformer-based point cloud recognition models, albeit advanced, tend to overfit to specific patterns, consequently undermining their robustness against corruption. In this work, we introduce the Target-Guided Adversarial Point Cloud Transformer, termed APCT, a novel architecture designed to augment global structure capture through an adversarial feature erasing mechanism predicated on patterns discerned at each step during training. Specifically, APCT integrates an Adversarial Significance Identifier and a Target-guided Promptor. The Adversarial Significance Identifier, is tasked with discerning token significance by integrating global contextual analysis, utilizing a structural salience index algorithm alongside an auxiliary supervisory mechanism. The Target-guided Promptor, is responsible for accentuating the propensity for token discard within the self-attention mechanism, utilizing the value derived above, consequently directing the model attention towards alternative segments in subsequent stages. By iteratively applying this strategy in multiple steps during training, the network progressively identifies and integrates an expanded array of object-associated patterns. Extensive experiments demonstrate that our method achieves state-of-the-art results on multiple corruption benchmarks.

Jie Wang, Tingfa Xu, Lihe Ding, Jianan Li• 2024

Related benchmarks

Task	Dataset	Result
Point Cloud Classification	ScanObjectNN PB_T50_RS (test)	Overall Accuracy86.2	91
Point Cloud Classification	ModelNet-C (test)	mCE72.2	58
Classification	ScanObjectNN-C	RmCE74.2	24
Segmentation	ShapeNet-C (test)	RmCE (Robustness Metric)55.3	20
Classification	ModelNet40-C (test)	RmCE Error51.4	11
Part Segmentation	ShapeNet-C	mCE81.4	10
Classification	ScanObjectNN-C 1.0 (test)	mOA72.4	8
Point Cloud Classification	MVImageNet (test)	Overall Accuracy86.6	6

Showing 8 of 8 rows

Other info

Code

Follow for update

@wizwand_team Discord