Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Retrieval as Generation: A Unified Framework with Self-Triggered Information Planning

About

We revisit retrieval-augmented generation (RAG) by embedding retrieval control directly into generation. Instead of treating retrieval as an external intervention, we express retrieval decisions within token-level decoding, enabling end-to-end coordination without additional controllers or classifiers. Under the paradigm of Retrieval as Generation, we propose \textbf{GRIP} (\textbf{G}eneration-guided \textbf{R}etrieval with \textbf{I}nformation \textbf{P}lanning), a unified framework in which the model regulates retrieval behavior through control-token emission. Central to GRIP is \textit{Self-Triggered Information Planning}, which allows the model to decide when to retrieve, how to reformulate queries, and when to terminate, all within a single autoregressive trajectory. This design tightly couples retrieval and reasoning and supports dynamic multi-step inference with on-the-fly evidence integration. To supervise these behaviors, we construct a structured training set covering answerable, partially answerable, and multi-hop queries, each aligned with specific token patterns. Experiments on five QA benchmarks show that GRIP surpasses strong RAG baselines and is competitive with GPT-4o while using substantially fewer parameters.

Bo Li, Mingda Wang, Gexiang Fang, Shikun Zhang, Wei Ye• 2026

Related benchmarks

TaskDatasetResultRank
Question AnsweringWebQ
EM32
27
Question AnsweringPopQA
EM38.6
17
Question AnsweringTriviaQA
EM57.9
17
Question AnsweringTriviaQA
EM51.9
14
Question AnsweringHotpotQA
EM32
10
Question AnsweringPopQA
EM28.7
10
Question AnsweringNQ
EM21.2
10
Biomedical Question AnsweringBioASQ (test)
ROUGE54.8
8
Question AnsweringHotpotQA
CoverEM45
4
Question AnsweringPopQA
CoverEM49.1
4
Showing 10 of 14 rows

Other info

Follow for update