Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Agentic-R: Learning to Retrieve for Agentic Search

About

Agentic search has recently emerged as a powerful paradigm, where an agent interleaves multi-step reasoning with on-demand retrieval to solve complex questions. Despite its success, how to design a retriever for agentic search remains largely underexplored. Existing search agents typically rely on similarity-based retrievers, while similar passages are not always useful for final answer generation. In this paper, we propose a novel retriever training framework tailored for agentic search. Unlike retrievers designed for single-turn retrieval-augmented generation (RAG) that only rely on local passage utility, we propose to use both local query-passage relevance and global answer correctness to measure passage utility in a multi-turn agentic search. We further introduce an iterative training strategy, where the search agent and the retriever are optimized bidirectionally and iteratively. Different from RAG retrievers that are only trained once with fixed questions, our retriever is continuously improved using evolving and higher-quality queries from the agent. Extensive experiments on seven single-hop and multi-hop QA benchmarks demonstrate that our retriever, termed \ours{}, consistently outperforms strong baselines across different search agents. Our codes are available at: https://github.com/8421BCD/Agentic-R.

Wenhan Liu, Xinyu Ma, Yutao Zhu, Yuchen Li, Daiting Shi, Dawei Yin, Zhicheng Dou• 2026

Related benchmarks

TaskDatasetResultRank
Multi-hop Question Answering2WikiMultihopQA
EM49.07
278
Multi-hop Question AnsweringMuSiQue
EM22.54
106
Multi-hop Question AnsweringBamboogle
Exact Match48
97
Multi-hop Question AnsweringHotpotQA
Exact Match (EM)47.68
56
General Question AnsweringTriviaQA
Exact Match69.02
39
General Question AnsweringPopQA
EM44.14
36
General Question AnsweringNQ
Exact Match (EM)42.43
36
Question AnsweringCombined 7 Datasets
Average Score45
18
Showing 8 of 8 rows

Other info

Follow for update