Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic Search on WebWalker
Loading...
72.7
Accuracy
LiteResearcher-4B
46.908
53.604
60.3
66.996
Apr 20, 2026
Accuracy
Online Deployment Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Online Deployment Accuracy
LiteResearcher-4B
Context Window=128k, M...
2026.04
72.7
-
Tongyi DeepResearch 30B
Context Window=128k, M...
2026.04
72.2
-
AgentCPM-Explore-4B
Context Window=128k, M...
2026.04
68.1
-
AFM-RL-32B
Context Window=128k, M...
2026.04
63
-
WebExplorer-8B
Context Window=128k, M...
2026.04
62.7
-
Claude-4-Sonnet
Context Window=128k, M...
2026.04
61.7
-
DeepSeek-V3.1
Context Window=128k, M...
2026.04
61.2
-
Mirothinker 8B
Context Window=128k, M...
2026.04
60.6
-
WebDancer (QwQ)
Context Window=128k, M...
2026.04
47.9
-
Qwen3-235B-A22B-Instruct
Model Version=Instruct...
2026.04
-
59.5
Qwen3-30B-A3B-Instruct
Model Version=Instruct...
2026.04
-
45
AgenticQwen-8B
Parameters=8B
2026.04
-
50
AgenticQwen-30B-A3B
Parameters=30B-A3B
2026.04
-
52.5
Feedback
Search any
task
Search any
task