Nested Browser-Use Learning for Agentic Information Seeking
About
Information-seeking (IS) agents have achieved strong performance across a range of wide and deep search tasks, yet their tool use remains largely restricted to API-level snippet retrieval and URL-based page fetching, limiting access to the richer information available through real browsing. While full browser interaction could unlock deeper capabilities, its fine-grained control and verbose page content returns introduce substantial complexity for ReAct-style function-calling agents. To bridge this gap, we propose Nested Browser-Use Learning (NestBrowse), which introduces a minimal and complete browser-action framework that decouples interaction control from page exploration through a nested structure. This design simplifies agentic reasoning while enabling effective deep-web information acquisition. Empirical results on challenging deep IS benchmarks demonstrate that NestBrowse offers clear benefits in practice. Further in-depth analyses underscore its efficiency and flexibility.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Deep Research | xbench | Accuracy74 | 30 | |
| Deep Research Task | Browsecomp | Accuracy22.4 | 29 | |
| Information Seeking | BrowseComp standard (full) | Pass@131.6 | 20 | |
| Information Seeking | BrowseComp Chinese (full) | Pass@142.6 | 19 | |
| Deep Research | BrowseComp-ZH | Accuracy28.4 | 18 | |
| Information Seeking | XBench 2505 (full) | pass@175 | 17 | |
| Information Seeking | GAIA 103-question text-only | Pass@175.7 | 16 | |
| Deep Research | GAIA | Accuracy68.9 | 14 | |
| Information Seeking | XBench v2510 (full) | Pass@145 | 2 |