A Control Architecture for Training-Free Memory Use

About

Prompt-injected memory can improve reasoning without updating model weights, but it also creates a control problem: retrieved content helps only when it is applied in the right state. We study this problem in a strict training-free setting and formulate it as applicability control: when to trigger a memory-assisted second pass, when to trust it, and how to maintain the memory bank over time. Our method combines uncertainty-based routing, confidence-based selective acceptance, bank selection across rule and exemplar memory, and evidence-based governance of the memory bank over time. Under a locked training-free protocol with compute-matched controls, it improves two core arithmetic benchmarks by +7.0 points on SVAMP and +7.67 points on ASDiv over baseline. The same architecture also transfers to QA and agent benchmarks with smaller positive effects and shows the same positive direction on a second checkpoint for the main arithmetic tasks. On arithmetic, the main empirical pattern is that the control architecture, rather than raw memory exposure, drives the improvements on SVAMP and ASDiv. Mechanistically, confidence separates helpful from harmful rule-bank interventions, and under fixed retrieval the repair-versus-corrupt difference localizes to rows whose retrieved set actually contains the edited entries.

Yanzhen Lu, Muchen Jiang, Zhicheng Qian, Xingyu Zhou• 2026

Related benchmarks

Task	Dataset	Result
Mathematical Problem Solving	AIME 2024	Accuracy76.67	113
Arithmetic Reasoning	ASDIV	Accuracy85.2	69
Question Answering	OpenBookQA	Accuracy51.8	4
Question Answering	ARC Easy	Accuracy0.725	4
Question Answering	ARC Challenge	Accuracy27.8	4
Scientific Reasoning	ScienceWorld	Delta Accuracy0.0017	1

Showing 6 of 6 rows

Other info

Follow for update

@wizwand_team Discord