Exact Flow Linear Attention: Exact Solution from Continuous-Time Dynamics

About

In this paper, we introduce Exact Flow Linear Attention~(EFLA), an exact-flow formulation of delta-rule linear attention. We show that the delta-rule update can be interpreted as an explicit Euler discretization of an underlying continuous-time system. EFLA replaces this first-order update with the exact closed-form flow. By exploiting the rank-1 structure of the dynamics matrix, both the matrix exponential and the input integral collapse to a simple update that preserves delta-rule linear attention's algebraic structure, parameter count, linear-time complexity, and chunkwise parallelism. This attention mechanism removes the Euler discretization error of the delta-rule dynamics without introducing additional parameters. Experiments on robustness tests, language modeling benchmarks, and the MAD synthetic benchmark show that EFLA improves stability under corrupted and high-energy inputs, reduces perplexity, and achieves stronger downstream performance compared to SSM and Euler-style baselines. These results establish exact-flow integration as a principled and scalable update mechanism for delta-rule linear attention.

Jingdi Lei, Di Zhang, Soujanya Poria• 2025

Related benchmarks

Task	Dataset	Result
Commonsense Reasoning	HellaSwag	Accuracy44.5	1896
Commonsense Reasoning	WinoGrande	Accuracy52.1	1442
Commonsense Reasoning	PIQA	Accuracy68.9	757
Language Modeling	WikiText	PPL18.3	740
Language Modeling	LAMBADA	Accuracy43.2	412
Question Answering	BoolQ	--	317
Question Answering	SciQ	Accuracy84.2	283
Commonsense Reasoning	ARC Challenge	Accuracy26.4	243
Question Answering	OpenBookQA	Normalized Accuracy31.6	102
Common Sense Reasoning	ARC Easy	ARC (easy) Accuracy54.4	101

Showing 10 of 11 rows

Other info

GitHub

Follow for update

@wizwand_team Discord