Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Web-based Agent Reasoning on WebWalkerQA Easy
Loading...
72.5
Pass@3
ExpSeek
56.0368
60.3109
64.585
68.8591
Jan 13, 2026
Pass@3
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@3
ExpSeek
Base Model=Qwen3-32B
2026.01
72.5
ExpSeek
Base Model=Qwen3-8B
2026.01
69.17
REASONINGBANK+
Base Model=Qwen3-32B
2026.01
68.33
Training-Free GRPO
Base Model=Qwen3-32B
2026.01
63.95
No Experience
Base Model=Qwen3-32B
2026.01
62.5
REASONINGBANK+
Base Model=Qwen3-8B
2026.01
61.17
No Experience
Base Model=Qwen3-8B
2026.01
57.5
Training-Free GRPO
Base Model=Qwen3-8B
2026.01
56.67
Feedback
Search any
task
Search any
task