Memory Intelligence Agent

About

Deep research agents (DRAs) integrate LLM reasoning with external tools. Memory systems enable DRAs to leverage historical experiences, which are essential for efficient reasoning and autonomous evolution. Existing methods rely on retrieving similar trajectories from memory to aid reasoning, while suffering from key limitations of ineffective memory evolution and increasing storage and retrieval costs. To address these problems, we propose a novel Memory Intelligence Agent (MIA) framework, consisting of a Manager-Planner-Executor architecture. Memory Manager is a non-parametric memory system that can store compressed historical search trajectories. Planner is a parametric memory agent that can produce search plans for questions. Executor is another agent that can search and analyze information guided by the search plan. To build the MIA framework, we first adopt an alternating reinforcement learning paradigm to enhance cooperation between the Planner and the Executor. Furthermore, we enable the Planner to continuously evolve during test-time learning, with updates performed on-the-fly alongside inference without interrupting the reasoning process. Additionally, we establish a bidirectional conversion loop between parametric and non-parametric memories to achieve efficient memory evolution. Finally, we incorporate a reflection and an unsupervised judgment mechanisms to boost reasoning and self-evolution in the open world. Extensive experiments across eleven benchmarks demonstrate the superiority of MIA.

Jingyang Qiao, Weicheng Meng, Yu Cheng, Zhihang Lin, Zhizhong Zhang, Xin Tan, Jingyu Gong, Kun Shao, Yuan Xie• 2026

Related benchmarks

Task	Dataset	Result
Visual Question Answering	SimpleVQA	Accuracy0.649	225
Visual Question Answering	LiveVQA	Accuracy43.1	151
Multimodal Search	MMSearch	Accuracy62.6	119
Visual Question Answering	InfoSeek	Accuracy65.5	77
Visual Question Answering	FVQA (test)	Accuracy69.6	51
Multimodal Research	In-house 2	Accuracy37.7	18
Multimodal Research	In-house 1	Accuracy31.8	18
Question Answering	SimpleQA out-domain (test)	--	11
Question Answering	2Wiki Out-of-Domain (test)	Accuracy71.8	9
Question Answering	HotpotQA Out-of-Domain (test)	Accuracy63.5	9

Showing 10 of 11 rows

Other info

GitHub

Follow for update

@wizwand_team Discord