Asynchronous Perception Machine For Efficient Test-Time-Training

About

In this work, we propose Asynchronous Perception Machine (APM), a computationally-efficient architecture for test-time-training (TTT). APM can process patches of an image one at a time in any order asymmetrically and still encode semantic-awareness in the net. We demonstrate APM's ability to recognize out-of-distribution images without dataset-specific pre-training, augmentation or any-pretext task. APM offers competitive performance over existing TTT approaches. To perform TTT, APM just distills test sample's representation once. APM possesses a unique property: it can learn using just this single representation and starts predicting semantically-aware features. APM demostrates potential applications beyond test-time-training: APM can scale up to a dataset of 2D images and yield semantic-clusterings in a single forward pass. APM also provides first empirical evidence towards validating GLOM's insight, i.e. input percept is a field. Therefore, APM helps us converge towards an implementation which can do both interpolation and perception on a shared-connectionist hardware. Our code is publicly available at this link: https://rajatmodi62.github.io/apm_project_page/.

Rajat Modi, Yogesh Singh Rawat• 2024

Related benchmarks

Task	Dataset	Result
Image Classification	ImageNet-1k (val)	--	1498
Image Classification	ImageNet V2	Top-1 Acc83.9	749
Image Classification	ImageNet A	Top-1 Acc84.2	698
Image Classification	ImageNet-R	Top-1 Acc94.9	581
Image Classification	ImageNet-Sketch	Top-1 Accuracy77.1	473
Image Classification	CIFAR-10C Severity Level 5 (test)	Average Error Rate (Severity 5)14.8	136
Image Classification	ImageNet-C level 5	Avg Top-1 Acc (ImageNet-C L5)50.3	110
Fine grained classification	UCF101	Accuracy72.6	81
Fine grained classification	Aircraft	Top-1 Acc29.7	72
Fine grained classification	Caltech101	Accuracy89.6	60

Showing 10 of 21 rows

Other info

Code

Follow for update

@wizwand_team Discord