Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic Search on Xbench DeepSearch 2505
Loading...
78
Accuracy
LiteResearcher-4B
36.712
47.431
58.15
68.869
Apr 20, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
LiteResearcher-4B
Context Window=128k, M...
2026.04
78
OpenAI-GPT-5-high
Context Window=128k, M...
2026.04
77.8
Tongyi DeepResearch 30B
Context Window=128k, M...
2026.04
75
Minimax-M2
Context Window=128k, M...
2026.04
72
DeepSeek-V3.2
Context Window=128k, M...
2026.04
71
DeepSeek-V3.1
Context Window=128k, M...
2026.04
71
GLM-4.6
Context Window=128k, M...
2026.04
70
AgentCPM-Explore-4B
Context Window=128k, M...
2026.04
70
Kimi-Researcher
Context Window=128k, M...
2026.04
69
Claude-4.5-Sonnet
Context Window=128k, M...
2026.04
66
Claude-4-Sonnet
Context Window=128k, M...
2026.04
64.6
DeepMiner-32B
Context Window=128k, M...
2026.04
62
Kimi-K2-0905
Context Window=128k, M...
2026.04
61
Mirothinker 8B
Context Window=128k, M...
2026.04
60.6
WebExplorer-8B
Context Window=128k, M...
2026.04
53.7
WebSailor 30B
Context Window=128k, M...
2026.04
53.3
ASearcher QWQ v2
Context Window=128k, M...
2026.04
51.1
WebDancer (QwQ)
Context Window=128k, M...
2026.04
38.3
Feedback
Search any
task
Search any
task