Nested Browser-Use Learning for Agentic Information Seeking
About
Information-seeking (IS) agents have achieved strong performance across a range of wide and deep search tasks, yet their tool use remains largely restricted to API-level snippet retrieval and URL-based page fetching, limiting access to the richer information available through real browsing. While full browser interaction could unlock deeper capabilities, its fine-grained control and verbose page content returns introduce substantial complexity for ReAct-style function-calling agents. To bridge this gap, we propose Nested Browser-Use Learning (NestBrowse), which introduces a minimal and complete browser-action framework that decouples interaction control from page exploration through a nested structure. This design simplifies agentic reasoning while enabling effective deep-web information acquisition. Empirical results on challenging deep IS benchmarks demonstrate that NestBrowse offers clear benefits in practice. Further in-depth analyses underscore its efficiency and flexibility.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-hop Question Answering | MuSiQue | EM24.4 | 50 | |
| Deep Research | xbench | Accuracy74 | 30 | |
| Multi-hop Question Answering | SQuAD | Exact Match (EM)19.2 | 30 | |
| Web Browsing | BC-plus | EM11.2 | 30 | |
| Deep Research Task | Browsecomp | Accuracy22.4 | 29 | |
| Deep Research | GAIA | Accuracy68.9 | 24 | |
| Information Seeking | BrowseComp standard (full) | Pass@131.6 | 20 | |
| Information Seeking | BrowseComp Chinese (full) | Pass@142.6 | 19 | |
| Deep Research | BrowseComp-ZH | Accuracy28.4 | 18 | |
| Information Seeking | XBench 2505 (full) | pass@175 | 17 |