InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training
About
GUI agents that interact with graphical interfaces on behalf of users represent a promising direction for practical AI assistants. However, training such agents is hindered by the scarcity of suitable environments. We present InfiniteWeb, a system that automatically generates functional web environments at scale for GUI agent training. While LLMs perform well on generating a single webpage, building a realistic and functional website with many interconnected pages faces challenges. We address these challenges through unified specification, task-centric test-driven development, and a combination of website seed with reference design image to ensure diversity. Our system also generates verifiable task evaluators enabling dense reward signals for reinforcement learning. Experiments show that InfiniteWeb surpasses commercial coding agents at realistic website construction, and GUI agents trained on our generated environments achieve significant performance improvements on OSWorld and Online-Mind2Web, demonstrating the effectiveness of proposed system.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Web Navigation Task Success | MIND2WEB ONLINE (test) | -- | 18 | |
| GUI Agent Task Completion | OSWorld 1.0 (test) | Success Rate (Chrome)36.9 | 9 | |
| Website Generation Functional Correctness | WebGen-Bench | Content Presentation Score0.915 | 4 | |
| Visual Quality Evaluation | WebGen-Bench | Win Rate85.4 | 3 |