Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Mitigating Provenance-Role Collapse in Long-Term Agents via Typed Memory Representation

About

Long-term memory is essential for persistent LLM agents, yet prevailing architectures store historical interactions as unstructured, flat text. This unconstrained storage induces provenance-role collapse, a critical failure mode where agents suffer from source-monitoring errors. To resolve this cognitive vulnerability at the architectural level, we propose MemIR, a typed Memory Intermediate Representation that operationalizes source monitoring as a structural constraint. MemIR writes long-term memory into grounded atoms that separate raw evidence, retrieval cues, and truth-bearing claims, with factual authorization restricted to supported claim atoms. It then applies multi-route atomic projection and provenance-scoped utilization to transform heterogeneous retrieval hits into claim-centered candidate bundles and a normalized fact interface for answer generation. Experiments on LoCoMo and BEAM-100K demonstrate that MemIR consistently outperforms existing memory baselines, especially on tasks requiring source tracking, temporal grounding, and aggregation of fragmented evidence.

Zhengda Jin, Bingbing Wang, Jing Li, Ruifeng Xu, Min Zhang• 2026

Related benchmarks

TaskDatasetResultRank
Multi-hop Question AnsweringLocomo
F142.9
125
Open-domain Question AnsweringLocomo
F10.342
111
Single-hop Question AnsweringLocomo
F10.622
111
Temporal Question AnsweringLocomo
F10.635
85
Memory Capability EvaluationBEAM 100K tokens
Abstention Rate37.5
18
Showing 5 of 5 rows

Other info

Follow for update