Asynchronous Perception Machine For Efficient Test-Time-Training
About
In this work, we propose Asynchronous Perception Machine (APM), a computationally-efficient architecture for test-time-training (TTT). APM can process patches of an image one at a time in any order asymmetrically and still encode semantic-awareness in the net. We demonstrate APM's ability to recognize out-of-distribution images without dataset-specific pre-training, augmentation or any-pretext task. APM offers competitive performance over existing TTT approaches. To perform TTT, APM just distills test sample's representation once. APM possesses a unique property: it can learn using just this single representation and starts predicting semantically-aware features. APM demostrates potential applications beyond test-time-training: APM can scale up to a dataset of 2D images and yield semantic-clusterings in a single forward pass. APM also provides first empirical evidence towards validating GLOM's insight, i.e. input percept is a field. Therefore, APM helps us converge towards an implementation which can do both interpolation and perception on a shared-connectionist hardware. Our code is publicly available at this link: https://rajatmodi62.github.io/apm_project_page/.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Image Classification | ImageNet-1k (val) | -- | 1453 | |
| Image Classification | ImageNet A | Top-1 Acc84.2 | 553 | |
| Image Classification | ImageNet V2 | Top-1 Acc83.9 | 487 | |
| Image Classification | ImageNet-R | Top-1 Acc94.9 | 474 | |
| Image Classification | ImageNet-Sketch | Top-1 Accuracy77.1 | 360 | |
| Image Classification | CIFAR-10C Severity Level 5 (test) | Average Error Rate (Severity 5)14.8 | 62 | |
| Fine grained classification | Aircraft | Top-1 Acc29.7 | 62 | |
| Image Classification | ImageNet-C level 5 | Avg Top-1 Acc (ImageNet-C L5)50.3 | 61 | |
| Image Classification | ImageNet-C level 3 (test) | Acc (Brightness)80.5 | 34 | |
| Fine grained classification | Food101 | -- | 30 |