Nested Browser-Use Learning for Agentic Information Seeking

About

Information-seeking (IS) agents have achieved strong performance across a range of wide and deep search tasks, yet their tool use remains largely restricted to API-level snippet retrieval and URL-based page fetching, limiting access to the richer information available through real browsing. While full browser interaction could unlock deeper capabilities, its fine-grained control and verbose page content returns introduce substantial complexity for ReAct-style function-calling agents. To bridge this gap, we propose Nested Browser-Use Learning (NestBrowse), which introduces a minimal and complete browser-action framework that decouples interaction control from page exploration through a nested structure. This design simplifies agentic reasoning while enabling effective deep-web information acquisition. Empirical results on challenging deep IS benchmarks demonstrate that NestBrowse offers clear benefits in practice. Further in-depth analyses underscore its efficiency and flexibility.

Baixuan Li, Jialong Wu, Wenbiao Yin, Kuan Li, Zhongwang Zhang, Huifeng Yin, Zhengwei Tao, Liwen Zhang, Pengjun Xie, Jingren Zhou, Yong Jiang• 2025

Related benchmarks

Task	Dataset	Result
Multi-hop Question Answering	MuSiQue	EM24.4	50
Deep Research	xbench	Accuracy74	30
Multi-hop Question Answering	SQuAD	Exact Match (EM)19.2	30
Web Browsing	BC-plus	EM11.2	30
Deep Research Task	Browsecomp	Accuracy22.4	29
Deep Research	GAIA	Accuracy68.9	24
Information Seeking	BrowseComp standard (full)	Pass@131.6	20
Information Seeking	BrowseComp Chinese (full)	Pass@142.6	19
Deep Research	BrowseComp-ZH	Accuracy28.4	18
Information Seeking	XBench 2505 (full)	pass@175	17

Showing 10 of 13 rows

Other info

Follow for update

@wizwand_team Discord