Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Deep Search on GAIA text-only
Loading...
0.757
Score
Minimax-M2-230B
0.171688
0.323644
0.4756
0.627556
Feb 13, 2026
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Minimax-M2-230B
Category=Large Foundat...
2026.02
0.757
GLM-4.6-357B
Category=Large Foundat...
2026.02
0.719
Tongyi-DeepResearch-30B
Category=Research Agen...
2026.02
0.709
Nanbeige4.1-3B
Category=Ours, Paramet...
2026.02
0.699
MiroThinker-v1.0-8B
Category=Research Agen...
2026.02
0.664
AgentCPM-Explore-4B
Category=Research Agen...
2026.02
0.639
DeepSeek-V3.2-671B
Category=Large Foundat...
2026.02
0.635
Qwen3-Next-80B-A3B
Category=Small Foundat...
2026.02
0.3402
Qwen3-30B-A3B-2507
Category=Small Foundat...
2026.02
0.3163
Qwen3-14B
Category=Small Foundat...
2026.02
0.3023
Qwen3-32B
Category=Small Foundat...
2026.02
0.3017
Qwen3-4B-2507
Category=Small Foundat...
2026.02
0.2833
Qwen3-8B
Category=Small Foundat...
2026.02
0.1953
Nanbeige4-3B-2511
Category=Baseline, Par...
2026.02
0.1942
Feedback
Search any
task
Search any
task