Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

WebChain: A Large-Scale Human-Annotated Dataset of Real-World Web Interaction Traces

About

We introduce WebChain, the largest open-source dataset of human-annotated trajectories on real-world websites, designed to accelerate reproducible research in web agents. It contains 31,725 trajectories and 318k steps, featuring a core Triple Alignment of visual, structural, and action data to provide rich, multi-modal supervision. The data is collected via a scalable pipeline that ensures coverage of complex, high-value tasks often missed by synthetic methods. Leveraging this dataset, we propose a Dual Mid-Training recipe that decouples spatial grounding from planning, achieving state-of-the-art performance on our proposed WebChainBench and other public GUI benchmarks. Our work provides the data and insights necessary to build and rigorously evaluate the next generation of scalable web agents.

Sicheng Fan, Rui Wan, Yifei Leng, Gaoning Liang, Li Ling, Yanyi Shang, Dehan Kong• 2026

Related benchmarks

TaskDatasetResultRank
GUI Interaction ControlGUI-Odyssey
SR54.8
31
GUI planningAndroidControl Low
SR (%)74.1
31
GUI Interaction ControlAndroidControl High
Type Score86.7
10
GUI Interaction ControlGUI-Act-Web
Type Accuracy96.3
10
GUI Interaction ControlOmniAct Desktop
Type Accuracy99.7
10
GUI Interaction ControlOmniAct-Web
Type Accuracy96.2
10
GUI Interaction ControlOmniAct GUI-Act Aggregate
Overall Score81.4
10
Showing 7 of 7 rows

Other info

Follow for update