TRUST: Test-Time Refinement using Uncertainty-Guided SSM Traverses
About
State Space Models (SSMs) have emerged as efficient alternatives to Vision Transformers (ViTs), with VMamba standing out as a pioneering architecture designed for vision tasks. However, their generalization performance degrades significantly under distribution shifts. To address this limitation, we propose TRUST (Test-Time Refinement using Uncertainty-Guided SSM Traverses), a novel test-time adaptation (TTA) method that leverages diverse traversal permutations to generate multiple causal perspectives of the input image. Model predictions serve as pseudo-labels to guide updates of the Mamba-specific parameters, and the adapted weights are averaged to integrate the learned information across traversal scans. Altogether, TRUST is the first approach that explicitly leverages the unique architectural properties of SSMs for adaptation. Experiments on seven benchmarks show that TRUST consistently improves robustness and outperforms existing TTA methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | PACS (test) | Average Accuracy69.9 | 279 | |
| Image Classification | ImageNet V2 (test) | Top-1 Accuracy64 | 232 | |
| Image Classification | CIFAR-10-C | Accuracy77.5 | 179 | |
| Image Classification | ImageNet-R (test) | -- | 170 | |
| Image Classification | ImageNet-C (val) | mCE53.4 | 105 | |
| Image Classification | ImageNet-S (test) | Top-1 Acc41.5 | 17 | |
| Classification | CIFAR10-C (test) | Gaussian Noise63.1 | 8 | |
| Image Classification | CIFAR100-C | Top-1 Accuracy54.3 | 8 | |
| Image Classification | ImageNet-C | Top-1 Accuracy0.561 | 8 | |
| Classification | CIFAR100-C (test) | Gaussian Noise Error32.1 | 8 |