Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SimpleMem: Efficient Lifelong Memory for LLM Agents

About

To support long-term interaction in complex environments, LLM agents require memory systems that manage historical experiences. Existing approaches either retain full interaction histories via passive context extension, leading to substantial redundancy, or rely on iterative reasoning to filter noise, incurring high token costs. To address this challenge, we introduce SimpleMem, an efficient memory framework based on semantic lossless compression. We propose a three-stage pipeline designed to maximize information density and token utilization: (1) Semantic Structured Compression, which distills unstructured interactions into compact, multi-view indexed memory units; (2) Online Semantic Synthesis, an intra-session process that instantly integrates related context into unified abstract representations to eliminate redundancy; and (3) Intent-Aware Retrieval Planning, which infers search intent to dynamically determine retrieval scope and construct precise context efficiently. Experiments on benchmark datasets show that our method consistently outperforms baseline approaches in accuracy, retrieval efficiency, and inference cost, achieving an average F1 improvement of 26.4% in LoCoMo while reducing inference-time token consumption by up to 30-fold, demonstrating a superior balance between performance and efficiency. Code is available at https://github.com/aiming-lab/SimpleMem.

Jiaqi Liu, Yaofeng Su, Peng Xia, Siwei Han, Zeyu Zheng, Cihang Xie, Mingyu Ding, Huaxiu Yao• 2026

Related benchmarks

TaskDatasetResultRank
Long-term memory evaluationLocomo
Overall F143.2
119
Interactive Decision-makingAlfWorld
Overall Success Rate29.7
118
Long-context Question AnsweringLocomo
F1 (Multi Hop)43.46
109
Embodied TaskAlfWorld
Overall Success Rate62.5
96
Mathematical ReasoningAMC 2023
Pass@173.1
67
Interactive web-based shopping tasksWebshop
Score33.2
60
Long-context Memory EvaluationMem-Gallery
F1 Score53.8
35
Long-term Question AnsweringLocomo
Multi-Hop F143.46
27
Agentic ReasoningALFWorld (test)
Success Rate62.5
21
Mathematical ReasoningOmni-MATH
Algebra Accuracy33.9
16
Showing 10 of 13 rows

Other info

GitHub

Follow for update