Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization

About

Recent deep research agents primarily improve performance by scaling reasoning depth, but this leads to high inference cost and latency in search-intensive scenarios. Moreover, generalization across heterogeneous research settings remains challenging. In this work, we propose \emph{Search More, Think Less} (SMTL), a framework for long-horizon agentic search that targets both efficiency and generalization. SMTL replaces sequential reasoning with parallel evidence acquisition, enabling efficient context management under constrained context budgets. To support generalization across task types, we further introduce a unified data synthesis pipeline that constructs search tasks spanning both deterministic question answering and open-ended research scenarios with task appropriate evaluation metrics. We train an end-to-end agent using supervised fine-tuning and reinforcement learning, achieving strong and often state of the art performance across benchmarks including BrowseComp (48.6\%), GAIA (75.7\%), Xbench (82.0\%), and DeepResearch Bench (45.9\%). Compared to Mirothinker-v1.0, SMTL with maximum 100 interaction steps reduces the average number of reasoning steps on BrowseComp by 70.7\%, while improving accuracy.

Qianben Chen, Tianrui Qin, King Zhu, Qiexiang Wang, Chengjun Yu, Shu Xu, Jiaqi Wu, Jiayu Zhang, Xinpeng Liu, Xin Gui, Jingyi Cao, Piaohong Wang, Dingfeng Shi, He Zhu, Tiannan Wang, Yuqing Wang, Maojia Song, Tianyu Zheng, Ge Zhang, Jian Yang, Jiaheng Liu, Minghao Liu, Yuchen Eleanor Jiang, Wangchunshu Zhou• 2026

Related benchmarks

TaskDatasetResultRank
Deep searchBrowse Comp
Score48.6
38
Web Navigation Question AnsweringWebWalker QA
Accuracy76.5
23
Deep searchxBench DeepSearch DS-2505
Score82
20
General AI AssistantGAIA Text-Only
Score75.7
19
Deep searchBrowseComp en
Score43.6
15
Deep searchxbench DeepSearch
Score80
7
Showing 6 of 6 rows

Other info

GitHub

Follow for update