Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scaling Retrieval-Augmented Reasoning with Parallel Search and Explicit Merging

About

Deep search agents have proven effective in enhancing LLMs by retrieving external knowledge during multi-step reasoning. However, existing methods often generate a single query for retrieval at each reasoning step, limiting information coverage and introducing high noise. This may result in low signal-to-noise ratios (SNR) during search, degrading reasoning accuracy and leading to unnecessary reasoning steps. In this paper, we introduce MultiSearch, an RL-based framework that addresses these limitations through multi-query retrieval and explicit merging of retrieved information. At each reasoning step, MultiSearch generates queries from multiple perspectives and retrieves external information in parallel, expanding the scope of relevant information and mitigating the reliance on any single retrieval result. Then, the agent consolidates and refines retrieved information at the merging process, improving the SNR and ensuring more accurate reasoning. Additionally, we propose a reinforcement learning framework with a multi-process reward design to optimize agents for both multi-query retrieval and information consolidation. Extensive experiments on seven benchmarks demonstrate that MultiSearch outperforms baseline methods, enhancing the SNR of retrieval and improving reasoning performance in question-answering tasks.

Jiabei Liu, Wenyu Mao, Junfei Tan, Chunxu Shen, Lingling Yi, Jiancan Wu, Xiang Wang• 2026

Related benchmarks

TaskDatasetResultRank
Multi-hop Question AnsweringMuSiQue
EM16.3
209
Single-hop Question AnsweringPopQA
EM45.5
186
Single-hop Question AnsweringTriviaQA
EM63
133
Multi-hop Question AnsweringHotpotQA
Exact Match (EM)44.6
66
Multi-hop Question AnsweringBamboogle
Accuracy47.6
44
Multi-hop Question Answering2Wiki
Accuracy (2Wiki)41.6
44
Single-hop Question AnsweringNQ
Accuracy49.1
37
Multi-hop Question AnsweringHotpotQA
EM43.1
3
Multi-hop Question Answering2Wiki
EM41.3
3
Multi-hop Question AnsweringBamboogle
EM0.39
3
Showing 10 of 11 rows

Other info

Follow for update