WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich Seeking

About

Large Language Model (LLM)-based agents have emerged as a transformative approach for open-ended problem solving, with information seeking (IS) being a core capability that enables autonomous reasoning and decision-making. While prior research has largely focused on improving retrieval depth, we observe that current IS agents often suffer from low search efficiency, which in turn constrains overall performance. A key factor underlying this inefficiency is the sparsity of target entities in training tasks, which limits opportunities for agents to learn and generalize efficient search behaviors. To address these challenges, we propose WebLeaper, a framework for constructing high-coverage IS tasks and generating efficient solution trajectories. We formulate IS as a tree-structured reasoning problem, enabling a substantially larger set of target entities to be embedded within a constrained context. Leveraging curated Wikipedia tables, we propose three variants for synthesizing IS tasks, Basic, Union, and Reverse-Union, to systematically increase both IS efficiency and efficacy. Finally, we curate training trajectories by retaining only those that are simultaneously accurate and efficient, ensuring that the model is optimized for both correctness and search performance. Extensive experiments on both basic and comprehensive settings, conducted on five IS benchmarks, BrowserComp, GAIA, xbench-DeepSearch, WideSearch, and Seal-0, demonstrate that our method consistently achieves improvements in both effectiveness and efficiency over strong baselines.

Zhengwei Tao, Haiyang Shen, Baixuan Li, Wenbiao Yin, Jialong Wu, Kuan Li, Zhongwang Zhang, Huifeng Yin, Rui Ye, Liwen Zhang, Xinyu Wang, Pengjun Xie, Jingren Zhou, Yong Jiang• 2025

Related benchmarks

Task	Dataset	Result
Broad Information Seeking	WideSearch	--	34
Deep Information Search and Synthesis	xbench DeepSearch	Score72	22
Multi-step navigation and information location	BrowseComp English	Score27.7	22
Information Seeking	BrowseComp standard (full)	Pass@123	20
Agent Capability Evaluation	SEAL 0	--	19
Information Seeking	XBench 2505 (full)	pass@166	17
Information Seeking	GAIA 103-question text-only	Pass@167	16
General AI Assistant Task Completion	GAIA Text-Only	Accuracy0.732	15
Web agent task completion	HLE	Accuracy29.8	15
Web agent task completion	Browsecomp	Accuracy38.8	15

Showing 10 of 14 rows

Other info

Follow for update

@wizwand_team Discord