Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

WebDancer: Towards Autonomous Information Seeking Agency

About

Addressing intricate real-world problems necessitates in-depth information seeking and multi-step reasoning. Recent progress in agentic systems, exemplified by Deep Research, underscores the potential for autonomous multi-step research. In this work, we present a cohesive paradigm for building end-to-end agentic information seeking agents from a data-centric and training-stage perspective. Our approach consists of four key stages: (1) browsing data construction, (2) trajectories sampling, (3) supervised fine-tuning for effective cold start, and (4) reinforcement learning for enhanced generalisation. We instantiate this framework in a web agent based on the ReAct, WebDancer. Empirical evaluations on the challenging information seeking benchmarks, GAIA and WebWalkerQA, demonstrate the strong performance of WebDancer, achieving considerable results and highlighting the efficacy of our training paradigm. Further analysis of agent training provides valuable insights and actionable, systematic pathways for developing more capable agentic models. The codes and demo will be released in https://github.com/Alibaba-NLP/WebAgent.

Jialong Wu, Baixuan Li, Runnan Fang, Wenbiao Yin, Liwen Zhang, Zhengwei Tao, Dingchu Zhang, Zekun Xi, Gang Fu, Yong Jiang, Pengjun Xie, Fei Huang, Jingren Zhou• 2025

Related benchmarks

TaskDatasetResultRank
Deep ResearchBrowseComp-ZH (BC-zh) original (test)
Pass@118
45
Deep-search QABrowseComp (test)
Pass@13.8
24
Deep-search QAXbench-DeepSearch (test)
Pass@139
24
Deep ResearchGAIA text-only original (test)
Pass@151.5
20
Deep ResearchBrowseComp-EN (BC-en) original (test)
Pass@13.8
20
Information SeekingBrowseComp standard (full)
Pass@13.8
20
General AI AssistantGAIA text
GAIA Average Score40.7
19
Information SeekingBrowseComp Chinese (full)
Pass@118
19
Information SeekingBrowsecomp
Success Rate3.8
19
Web Browsing and NavigationWebWalkerQA
Average Accuracy38.4
18
Showing 10 of 27 rows

Other info

Follow for update