Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Pre-Storage Reasoning for Episodic Memory: Shifting Inference Burden to Memory for Personalized Dialogue

About

Effective long-term memory in conversational AI requires synthesizing information across multiple sessions. However, current systems place excessive reasoning burden on response generation, making performance significantly dependent on model sizes. We introduce PREMem (Pre-storage Reasoning for Episodic Memory), a novel approach that shifts complex reasoning processes from inference to memory construction. PREMem extracts fine-grained memory fragments categorized into factual, experiential, and subjective information; it then establishes explicit relationships between memory items across sessions, capturing evolution patterns like extensions, transformations, and implications. By performing this reasoning during pre-storage rather than when generating a response, PREMem creates enriched representations while reducing computational demands during interactions. Experiments show significant performance improvements across all model sizes, with smaller models achieving results comparable to much larger baselines while maintaining effectiveness even with constrained token budgets. Code and dataset are available at https://github.com/sangyeop-kim/PREMem.

Sangyeop Kim, Yohan Lee, Sanghwa Kim, Hyunjong Kim, Sungzoon Cho• 2025

Related benchmarks

TaskDatasetResultRank
Question AnsweringNarrativeQA (test)
ROUGE-L11.64
61
Long-context Memory RetrievalLocomo
Single-hop66.2
55
Question AnsweringWikihop (test)
Accuracy45.87
32
Long-term memory evaluationLongMemEval S (test)
KU (Knowledge Update)84.6
27
Question AnsweringHotpotQA (test)
Accuracy87.7
24
Question AnsweringMerged QA HotpotQA, NarrativeQA, WikiHop (test)
Accuracy45.81
24
Long-context Question AnsweringLocomo
Single-Hop LLJ Score66.2
24
Long-term Memory RetrievalLongMemEval-S
SSU92.9
9
Showing 8 of 8 rows

Other info

Follow for update