Exact Flow Linear Attention: Exact Solution from Continuous-Time Dynamics
About
In this paper, we introduce Exact Flow Linear Attention~(EFLA), an exact-flow formulation of delta-rule linear attention. We show that the delta-rule update can be interpreted as an explicit Euler discretization of an underlying continuous-time system. EFLA replaces this first-order update with the exact closed-form flow. By exploiting the rank-1 structure of the dynamics matrix, both the matrix exponential and the input integral collapse to a simple update that preserves delta-rule linear attention's algebraic structure, parameter count, linear-time complexity, and chunkwise parallelism. This attention mechanism removes the Euler discretization error of the delta-rule dynamics without introducing additional parameters. Experiments on robustness tests, language modeling benchmarks, and the MAD synthetic benchmark show that EFLA improves stability under corrupted and high-energy inputs, reduces perplexity, and achieves stronger downstream performance compared to SSM and Euler-style baselines. These results establish exact-flow integration as a principled and scalable update mechanism for delta-rule linear attention.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Commonsense Reasoning | HellaSwag | Accuracy44.5 | 1896 | |
| Commonsense Reasoning | WinoGrande | Accuracy52.1 | 1442 | |
| Commonsense Reasoning | PIQA | Accuracy68.9 | 757 | |
| Language Modeling | WikiText | PPL18.3 | 740 | |
| Language Modeling | LAMBADA | Accuracy43.2 | 412 | |
| Question Answering | BoolQ | -- | 317 | |
| Question Answering | SciQ | Accuracy84.2 | 283 | |
| Commonsense Reasoning | ARC Challenge | Accuracy26.4 | 243 | |
| Question Answering | OpenBookQA | Normalized Accuracy31.6 | 102 | |
| Common Sense Reasoning | ARC Easy | ARC (easy) Accuracy54.4 | 101 |