Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FAAST: Forward-Only Associative Learning via Closed-Form Fast Weights for Test-Time Supervised Adaptation

About

Adapting pretrained models typically involves a trade-off between the high training costs of backpropagation and the heavy inference overhead of memory-based or in-context learning. We propose FAAST, a forward-only associative adaptation method that analytically compiles labeled examples into fast weights in a single pass. By eliminating memory or context dependence, FAAST achieves constant-time inference and decouples task adaptation from pretrained representation. Across image classification and language modeling benchmarks, FAAST matches or exceeds backprop-based adaptation while reducing adaptation time by over 90% and is competitive to memory/context-based adaptation while saving memory usage by up to 95%. These results demonstrate FAAST as a highly efficient, scalable solution for supervised task adaptation, particularly for resource-constrained models. We release the code and models at https://github.com/baoguangsheng/faast.

Guangsheng Bao, Hongbo Zhang, Han Cui, Ke Sun, Yanbin Zhao, Juncai He, Yue Zhang• 2026

Related benchmarks

TaskDatasetResultRank
Language ModelingWikiText-103
PPL13.23
216
Machine TranslationIWSLT en-de 2017 (test)
BLEU37.1
46
Image ClassificationCIFAR-10 (full)
Top-1 Acc86.7
29
Machine TranslationIWSLT Fr-En 2017 (test)
BLEU43.93
22
Machine TranslationIWSLT En-Fr 2017 (test)
BLEU37.08
11
Sentiment ClassificationSST-2 (All)
Accuracy87.5
7
Image ClassificationCIFAR10 10-way 5-shot
Accuracy73.8
6
Image ClassificationminiImageNet 20-way 5-shot
Accuracy88.6
6
Image ClassificationminiImageNet 20-way (Full)
Accuracy93
6
Sentiment ClassificationSST-2 1-shot
Accuracy78.5
2
Showing 10 of 14 rows

Other info

GitHub

Follow for update