Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AutoSearch: Adaptive Search Depth for Efficient Agentic RAG via Reinforcement Learning

About

Agentic retrieval-augmented generation (RAG) systems enable large language models (LLMs) to solve complex tasks through multi-step interaction with external retrieval tools. However, such multi-step interaction often involves redundant search steps, incurring substantial computational cost and latency. Prior work limits search depth (i.e., the number of search steps) to reduce cost, but this often leads to underexploration of complex questions. To address this, we first investigate how search depth affects accuracy and find a minimal sufficient search depth that defines an accuracy-efficiency trade-off, jointly determined by question complexity and the agent's capability. Furthermore, we propose AutoSearch, a reinforcement learning (RL) framework that evaluates each search step via self-generated intermediate answers. By a self-answering mechanism, AutoSearch identifies the minimal sufficient search depth and promotes efficient search by rewarding its attainment while penalizing over-searching. In addition, reward mechanisms are introduced to stabilize search behavior and improve answer quality on complex questions. Extensive experiments on multiple benchmarks show that AutoSearch achieves a superior accuracy-efficiency trade-off, alleviating over-searching while preserving search quality.

Jingbo Sun, Wenyue Chong, Songjun Tu, Qichao Zhang, Yaocheng Zhang, Jiajun Chai, Xiaohan Wang, Wei Lin, Guojun Yin, Dongbin Zhao• 2026

Related benchmarks

TaskDatasetResultRank
Multi-hop Question Answering2WikiMultihopQA
EM43.6
559
Multi-hop Question AnsweringHotpotQA
Exact Match (EM)42.7
66
Multi-hop Question AnsweringBamboogle
Exact Match (EM)41.9
55
Question AnsweringTriviaQA
EM65.8
13
Question AnsweringPopQA
Exact Match (EM)46.3
13
Question AnsweringNQ
EM47.6
13
Multi-hop Question AnsweringBamboogle
OSR10.1
13
Multi-hop Question AnsweringHotpotQA
OSR Score5.78
13
General Question AnsweringNQ (Natural Questions)
OSR (%)1.83
13
General Question AnsweringTriviaQA
OSR (%)0.51
13
Showing 10 of 11 rows

Other info

Follow for update