Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PlugMem: A Task-Agnostic Plugin Memory Module for LLM Agents

About

Long-term memory is essential for large language model (LLM) agents operating in complex environments, yet existing memory designs are either task-specific and non-transferable, or task-agnostic but less effective due to low task-relevance and context explosion from raw memory retrieval. We propose PlugMem, a task-agnostic plugin memory module that can be attached to arbitrary LLM agents without task-specific redesign. Motivated by the fact that decision-relevant information is concentrated as abstract knowledge rather than raw experience, we draw on cognitive science to structure episodic memories into a compact, extensible knowledge-centric memory graph that explicitly represents propositional and prescriptive knowledge. This representation enables efficient memory retrieval and reasoning over task-relevant knowledge, rather than verbose raw trajectories, and departs from other graph-based methods like GraphRAG by treating knowledge as the unit of memory access and organization instead of entities or text chunks. We evaluate PlugMem unchanged across three heterogeneous benchmarks (long-horizon conversational question answering, multi-hop knowledge retrieval, and web agent tasks). The results show that PlugMem consistently outperforms task-agnostic baselines and exceeds task-specific memory designs, while also achieving the highest information density under a unified information-theoretic analysis. Code and data are available at https://github.com/TIMAN-group/PlugMem.

Ke Yang, Zixi Chen, Xuan He, Jize Jiang, Michel Galley, Chenglong Wang, Jianfeng Gao, Jiawei Han, ChengXiang Zhai• 2026

Related benchmarks

TaskDatasetResultRank
Long-term dialogue memoryLongMemEval (test)
Accuracy75.1
18
Multi-hop Question AnsweringHotpotQA Full
EM61.4
9
Web navigationWebArena (online)
Shopping Success Rate52.6
5
Web navigationWebArena (offline)
Shopping Success Rate58.4
5
Showing 4 of 4 rows

Other info

Follow for update