Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Weight-Space Linear Recurrent Neural Networks

About

We introduce WARP (Weight-space Adaptive Recurrent Prediction), a simple yet powerful model that unifies weight-space learning with linear recurrence to redefine sequence modeling. Unlike conventional recurrent neural networks (RNNs) which collapse temporal dynamics into fixed-dimensional hidden states, WARP explicitly parametrizes its hidden state as the weights and biases of a distinct auxiliary neural network, and uses input differences to drive its recurrence. This brain-inspired formulation enables efficient gradient-free adaptation of the auxiliary network at test-time, in-context learning abilities, and seamless integration of domain-specific physical priors. Empirical validation shows that WARP matches or surpasses state-of-the-art baselines on diverse classification tasks, featuring in the top three in 4 out of 6 real-world challenging datasets. Furthermore, extensive experiments across sequential image completion, multivariate time series forecasting, and dynamical system reconstruction demonstrate its expressiveness and generalisation capabilities. Remarkably, a physics-informed variant of our model outperforms the next best model by more than 10x. Ablation studies confirm the architectural necessity of key components, solidifying weight-space linear RNNs as a transformative paradigm for adaptive machine intelligence.

Roussel Desmond Nzoyem, Nawid Keshtmand, Enrique Crespo Fernandez, Idriss Tsayem, Raul Santos-Rodriguez, David A.W. Barton, Tom Deakin• 2025

Related benchmarks

TaskDatasetResultRank
Traffic ForecastingPeMS08
RMSE10.1
242
Multivariate Time Series ClassificationSelfRegSCP1
Accuracy83.53
25
Image ModelingMNIST (test)
Bits/dim0.416
24
Multivariate Time Series ClassificationHeartbeat UEA (test)
Accuracy80.65
22
Multivariate Time Series ClassificationSelfRegulationSCP2 UEA (test)
Accuracy57.89
22
Multivariate Time Series ClassificationMotorImagery UEA (test)
Accuracy56.14
22
Image ModelingFashionMNIST (test)
Bits/dim0.59
12
Image CompletionCelebA (test)
MSE0.027
12
Multivariate Time Series ClassificationEthanolConcentration UEA (test)
Accuracy36.49
11
Time-series classificationEigenWorms UEA (test)
Accuracy70.93
11
Showing 10 of 10 rows

Other info

Follow for update