Share your thoughts, 1 month free Claude Pro on usSee more

Web-based Agent Reasoning on WebWalkerQA Hard

0.6333Pass@3

ExpSeek

Updated 4mo ago

Evaluation Results

Method	Links
ExpSeek 2026.01		0.6333
ExpSeek 2026.01		0.6333
No Experience 2026.01		0.5833
Training-Free GRPO 2026.01		0.5745
REASONINGBANK+ 2026.01		0.5644
Training-Free GRPO 2026.01		0.5267
REASONINGBANK+ 2026.01		0.5
No Experience 2026.01		0.4611