Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MemWeaver: Weaving Hybrid Memories for Traceable Long-Horizon Agentic Reasoning

About

Large language model-based agents operating in long-horizon interactions require memory systems that support temporal consistency, multi-hop reasoning, and evidence-grounded reuse across sessions. Existing approaches largely rely on unstructured retrieval or coarse abstractions, which often lead to temporal conflicts, brittle reasoning, and limited traceability. We propose MemWeaver, a unified memory framework that consolidates long-term agent experiences into three interconnected components: a temporally grounded graph memory for structured relational reasoning, an experience memory that abstracts recurring interaction patterns from repeated observations, and a passage memory that preserves original textual evidence. MemWeaver employs a dual-channel retrieval strategy that jointly retrieves structured knowledge and supporting evidence to construct compact yet information-dense contexts for reasoning. Experiments on the LoCoMo benchmark demonstrate that MemWeaver substantially improves multi-hop and temporal reasoning accuracy while reducing input context length by over 95\% compared to long-context baselines.

Juexiang Ye, Xue Li, Xinyu Yang, Chengkai Huang, Lanshun Nie, Lina Yao, Dechen Zhan• 2026

Related benchmarks

TaskDatasetResultRank
Conversational Question AnsweringLoCoMo Multi-Hop
F1 Score26
20
Conversational Question AnsweringLoCoMo Temporal
F1 Score50.83
20
Conversational Question AnsweringLoCoMo Open-Domain
F120.73
20
Conversational Question AnsweringLoCoMo Overall
Avg Rank (F1)1
20
Long-horizon Question AnsweringLoCoMo Multi-Hop 1.0
EM426
20
Long-horizon Question AnsweringLoCoMo Temporal 1.0
EM1.12e+3
20
Long-horizon Question AnsweringLoCoMo Open-Domain 1.0
EM7.29
20
Long-horizon Question AnsweringLoCoMo Single-Hop 1.0
EM16.77
20
Long-horizon Question AnsweringLocomo
Multi-Hop RGE-L0.2568
20
Conversational Question AnsweringLoCoMo Single-Hop
F1 Score39.2
20
Showing 10 of 12 rows

Other info

Follow for update