Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Deep Search on xBench DeepSearch (05)
Loading...
75
Score
Tongyi-DeepResearch-30B
23
36.5
50
63.5
Feb 13, 2026
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Tongyi-DeepResearch-30B
Category=Research Agent
2026.02
75
Nanbeige4.1-3B
Category=Ours
2026.02
75
Minimax-M2-230B
Category=Large Foundat...
2026.02
72
DeepSeek-V3.2-671B
Category=Large Foundat...
2026.02
71
AgentCPM-Explore-4B
Category=Research Agent
2026.02
70
GLM-4.6-357B
Category=Large Foundat...
2026.02
70
MiroThinker-v1.0-8B
Category=Research Agent
2026.02
60.6
Qwen3-32B
Category=Small Foundat...
2026.02
39
Qwen3-4B-2507
Category=Small Foundat...
2026.02
34
Qwen3-14B
Category=Small Foundat...
2026.02
34
Nanbeige4-3B-2511
Category=Baseline
2026.02
33
Qwen3-8B
Category=Small Foundat...
2026.02
31
Qwen3-Next-80B-A3B
Category=Small Foundat...
2026.02
27
Qwen3-30B-A3B-2507
Category=Small Foundat...
2026.02
25
Feedback
Search any
task
Search any
task