Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scent of Knowledge: Optimizing Search-Enhanced Reasoning with Information Foraging

About

Augmenting large language models (LLMs) with external retrieval has become a standard method to address their inherent knowledge cutoff limitations. However, traditional retrieval-augmented generation methods employ static, pre-inference retrieval strategies, making them inadequate for complex tasks involving ambiguous, multi-step, or evolving information needs. Recent advances in test-time scaling techniques have demonstrated significant potential in enabling LLMs to dynamically interact with external tools, motivating the shift toward adaptive inference-time retrieval. Inspired by Information Foraging Theory (IFT), we propose InForage, a reinforcement learning framework that formalizes retrieval-augmented reasoning as a dynamic information-seeking process. Unlike existing approaches, InForage explicitly rewards intermediate retrieval quality, encouraging LLMs to iteratively gather and integrate information through adaptive search behaviors. To facilitate training, we construct a human-guided dataset capturing iterative search and reasoning trajectories for complex, real-world web tasks. Extensive evaluations across general question answering, multi-hop reasoning tasks, and a newly developed real-time web QA dataset demonstrate InForage's superior performance over baseline methods. These results highlight InForage's effectiveness in building robust, adaptive, and efficient reasoning agents.

Hongjin Qian, Zheng Liu• 2025

Related benchmarks

TaskDatasetResultRank
Multi-hop Question Answering2WikiMultihopQA
EM42.8
559
Multi-hop Question AnsweringHotpotQA (test)--
311
Multi-hop Question Answering2WikiMultiHopQA (test)
EM42.8
226
Question AnsweringPopQA--
186
Question AnsweringHotpotQA
EM40.9
173
Multi-hop Question AnsweringHotpotQA
Exact Match (EM)40.9
150
Multi-hop Question AnsweringBamboogle
Exact Match36
128
Multi-hop Question AnsweringMuSiQue (test)--
128
Question AnsweringTriviaQA--
117
Question Answering2WikiMultihopQA
EM42.8
107
Showing 10 of 22 rows

Other info

Follow for update