MemWeaver: Weaving Hybrid Memories for Traceable Long-Horizon Agentic Reasoning
About
Large language model-based agents operating in long-horizon interactions require memory systems that support temporal consistency, multi-hop reasoning, and evidence-grounded reuse across sessions. Existing approaches largely rely on unstructured retrieval or coarse abstractions, which often lead to temporal conflicts, brittle reasoning, and limited traceability. We propose MemWeaver, a unified memory framework that consolidates long-term agent experiences into three interconnected components: a temporally grounded graph memory for structured relational reasoning, an experience memory that abstracts recurring interaction patterns from repeated observations, and a passage memory that preserves original textual evidence. MemWeaver employs a dual-channel retrieval strategy that jointly retrieves structured knowledge and supporting evidence to construct compact yet information-dense contexts for reasoning. Experiments on the LoCoMo benchmark demonstrate that MemWeaver substantially improves multi-hop and temporal reasoning accuracy while reducing input context length by over 95\% compared to long-context baselines.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Conversational Question Answering | LoCoMo Multi-Hop | F1 Score26 | 20 | |
| Conversational Question Answering | LoCoMo Temporal | F1 Score50.83 | 20 | |
| Conversational Question Answering | LoCoMo Open-Domain | F120.73 | 20 | |
| Conversational Question Answering | LoCoMo Overall | Avg Rank (F1)1 | 20 | |
| Long-horizon Question Answering | LoCoMo Multi-Hop 1.0 | EM426 | 20 | |
| Long-horizon Question Answering | LoCoMo Temporal 1.0 | EM1.12e+3 | 20 | |
| Long-horizon Question Answering | LoCoMo Open-Domain 1.0 | EM7.29 | 20 | |
| Long-horizon Question Answering | LoCoMo Single-Hop 1.0 | EM16.77 | 20 | |
| Long-horizon Question Answering | Locomo | Multi-Hop RGE-L0.2568 | 20 | |
| Conversational Question Answering | LoCoMo Single-Hop | F1 Score39.2 | 20 |