Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

EverMemOS: A Self-Organizing Memory Operating System for Structured Long-Horizon Reasoning

About

Large Language Models (LLMs) are increasingly deployed as long-term interactive agents, yet their limited context windows make it difficult to sustain coherent behavior over extended interactions. Existing memory systems often store isolated records and retrieve fragments, limiting their ability to consolidate evolving user states and resolve conflicts. We introduce EverMemOS, a self-organizing memory operating system that implements an engram-inspired lifecycle for computational memory. Episodic Trace Formation converts dialogue streams into MemCells that capture episodic traces, atomic facts, and time-bounded Foresight signals. Semantic Consolidation organizes MemCells into thematic MemScenes, distilling stable semantic structures and updating user profiles. Reconstructive Recollection performs MemScene-guided agentic retrieval to compose the necessary and sufficient context for downstream reasoning. Experiments on LoCoMo and LongMemEval show that EverMemOS achieves state-of-the-art performance on memory-augmented reasoning tasks. We further report a profile study on PersonaMem v2 and qualitative case studies illustrating chat-oriented capabilities such as user profiling and Foresight. Code is available at https://github.com/EverMind-AI/EverMemOS.

Chuanrui Hu, Xingze Gao, Zuyi Zhou, Dannong Xu, Yi Bai, Xintong Li, Hui Zhang, Tong Li, Chong Zhang, Lidong Bing, Yafeng Deng• 2026

Related benchmarks

TaskDatasetResultRank
Long-term memory evaluationLocomo
Overall F192.3
128
Long-context Memory RetrievalLocomo
Single-hop96.1
80
Long-context ReasoningLocomo
Average F193.05
75
Long-term Memory RetrievalLongMemEval-S
SSU97.1
19
Long-term Conversational Memory Question AnsweringLoCoMo (test)
Single Hop F153.79
15
Conversational Memory Question AnsweringLocomo
Single-Hop J-Score91.08
10
Long-term Memory Question AnsweringLoCoMo 0.6.0
Accuracy94.48
10
Long-term Memory Question AnsweringREALTALK (Overall)
Multi-hop F135.78
7
Long-term Memory Question AnsweringLongMemEvalS
SSU F178.03
7
Long-context Memory EvaluationLongMemEval v0.6.0 (strict)
Accuracy83
7
Showing 10 of 12 rows

Other info

Follow for update