Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Self-Evolving Multi-Agent Systems via Decentralized Memory

About

Self-evolving multi-agent systems (MAS) have emerged as a promising route to LLM agents that continually improve from experience, with persistent memory at their foundation. However, existing designs almost exclusively adopt a centralized repository shared across agents, incurring communication and coordination overhead, raising privacy concerns, and collapsing agent diversity. We propose DecentMem, a decentralized memory framework in which each agent maintains its own dual-pool memory -- an exploitation pool of consolidated past trajectories and an exploration pool of LLM-generated candidates for unseen contexts. The two pools are reweighted online based on stage-wise feedback from an LLM-as-a-judge. Theoretically, we prove that this design guarantees global reachability of the solution space and achieves $O(\log T)$ cumulative regret, matching the stochastic bandit lower bound up to constants. In practice, across three MAS frameworks (AutoGen, DyLAN, AgentNet), three Qwen3 backbones (4B/8B/14B), two Gemma4 backbones (E2B/E4B) and five benchmarks spanning math, code, QA, and embodied tasks, DecentMem improves average accuracy by up to 23.8% over the strongest centralized memory baseline and by up to 52.5% over the no-memory baseline, while reducing token usage by up to 49%.

Guangya Hao, Yunbo Long, Zhuokai Zhao• 2026

Related benchmarks

TaskDatasetResultRank
Interactive Decision-makingAlfWorld
Overall Success Rate93.13
295
Code GenerationMBPP+
Accuracy84.74
236
General ReasoningBBH
Accuracy68.09
190
Mathematical ReasoningAIME 24/25
Accuracy43.33
171
Embodied Task CompletionAlfWorld
Success Rate92.28
96
Complex ReasoningBBH
Accuracy90.5
85
Showing 6 of 6 rows

Other info

Follow for update