Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Control Architecture for Training-Free Memory Use

About

Prompt-injected memory can improve reasoning without updating model weights, but it also creates a control problem: retrieved content helps only when it is applied in the right state. We study this problem in a strict training-free setting and formulate it as applicability control: when to trigger a memory-assisted second pass, when to trust it, and how to maintain the memory bank over time. Our method combines uncertainty-based routing, confidence-based selective acceptance, bank selection across rule and exemplar memory, and evidence-based governance of the memory bank over time. Under a locked training-free protocol with compute-matched controls, it improves two core arithmetic benchmarks by +7.0 points on SVAMP and +7.67 points on ASDiv over baseline. The same architecture also transfers to QA and agent benchmarks with smaller positive effects and shows the same positive direction on a second checkpoint for the main arithmetic tasks. On arithmetic, the main empirical pattern is that the control architecture, rather than raw memory exposure, drives the improvements on SVAMP and ASDiv. Mechanistically, confidence separates helpful from harmful rule-bank interventions, and under fixed retrieval the repair-versus-corrupt difference localizes to rows whose retrieved set actually contains the edited entries.

Yanzhen Lu, Muchen Jiang, Zhicheng Qian, Xingyu Zhou• 2026

Related benchmarks

TaskDatasetResultRank
Mathematical Problem SolvingAIME 2024
Accuracy76.67
113
Arithmetic ReasoningASDIV
Accuracy85.2
62
Question AnsweringOpenBookQA
Accuracy51.8
4
Question AnsweringARC Easy
Accuracy0.725
4
Question AnsweringARC Challenge
Accuracy27.8
4
Scientific ReasoningScienceWorld
Delta Accuracy0.0017
1
Showing 6 of 6 rows

Other info

Follow for update