Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Exact Flow Linear Attention: Exact Solution from Continuous-Time Dynamics

About

In this paper, we introduce Exact Flow Linear Attention~(EFLA), an exact-flow formulation of delta-rule linear attention. We show that the delta-rule update can be interpreted as an explicit Euler discretization of an underlying continuous-time system. EFLA replaces this first-order update with the exact closed-form flow. By exploiting the rank-1 structure of the dynamics matrix, both the matrix exponential and the input integral collapse to a simple update that preserves delta-rule linear attention's algebraic structure, parameter count, linear-time complexity, and chunkwise parallelism. This attention mechanism removes the Euler discretization error of the delta-rule dynamics without introducing additional parameters. Experiments on robustness tests, language modeling benchmarks, and the MAD synthetic benchmark show that EFLA improves stability under corrupted and high-energy inputs, reduces perplexity, and achieves stronger downstream performance compared to SSM and Euler-style baselines. These results establish exact-flow integration as a principled and scalable update mechanism for delta-rule linear attention.

Jingdi Lei, Di Zhang, Soujanya Poria• 2025

Related benchmarks

TaskDatasetResultRank
Commonsense ReasoningHellaSwag
Accuracy44.5
1896
Commonsense ReasoningWinoGrande
Accuracy52.1
1442
Commonsense ReasoningPIQA
Accuracy68.9
757
Language ModelingWikiText
PPL18.3
740
Language ModelingLAMBADA
Accuracy43.2
412
Question AnsweringBoolQ--
317
Question AnsweringSciQ
Accuracy84.2
283
Commonsense ReasoningARC Challenge
Accuracy26.4
243
Question AnsweringOpenBookQA
Normalized Accuracy31.6
102
Common Sense ReasoningARC Easy
ARC (easy) Accuracy54.4
101
Showing 10 of 11 rows

Other info

GitHub

Follow for update