GAM-RAG: Gain-Adaptive Memory for Evolving Retrieval in Retrieval-Augmented Generation

About

Retrieval-Augmented Generation (RAG) grounds large language models with external evidence, but many implementations rely on pre-built indices that remain static after construction. Related queries therefore repeat similar multi-hop traversal, increasing latency and compute. Motivated by schema-based learning in cognitive neuroscience, we propose GAM-RAG, a training-free framework that accumulates retrieval experience from recurring or related queries and updates retrieval memory over time. GAM-RAG builds a lightweight, relation-free hierarchical index whose links capture potential co-occurrence rather than fixed semantic relations. During inference, successful retrieval episodes provide sentence-level feedback, updating sentence memories so evidence useful for similar reasoning types becomes easier to activate later. To balance stability and adaptability under noisy feedback, we introduce an uncertainty-aware, Kalman-inspired gain rule that jointly updates memory states and perplexity-based uncertainty estimates. It applies fast updates for reliable novel signals and conservative refinement for stable or noisy memories. We provide a theoretical analysis of the update dynamics, and empirically show that GAM-RAG improves average performance by 3.95% over the strongest baseline and by 8.19% with 5-turn memory, while reducing inference cost by 61%. Our code and datasets are available at: https://anonymous.4open.science/r/GAM_RAG-2EF6.

Yifan Wang, Mingxuan Jiang, Zhihao Sun, Yixin Cao, Yicun Liu, Keyang Chen, Guangnan Ye, Hongfeng Chai• 2026

Related benchmarks

Task	Dataset	Result
Question Answering	2Wiki	--	241
Question Answering	MuSiQue	LLM Accuracy45.8	34
Question Answering	medical	GPT Accuracy68.81	31
Question Answering	HotpotQA	GPT Accuracy76.2	14
Question Answering	TimeQA	GPT Accuracy50.32	14
Retrieval-Augmented Generation	2Wiki Same Query	GPT Accuracy71.8	9
Retrieval-Augmented Generation	2Wiki Similar Query	GPT Accuracy71	9
Retrieval-Augmented Generation	2Wiki Different Query	GPT Accuracy69.8	9
Indexing	2Wiki	Tokens (M)1.66	7

Showing 9 of 9 rows

Other info

Follow for update

@wizwand_team Discord