Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General AI Assistant Task Completion on GAIA Text-Only
Loading...
0.874
Accuracy
Seed-1.8
0.50064
0.59757
0.6945
0.79143
Feb 6, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Seed-1.8
Model Access Type=Clos...
2026.02
0.874
OpenAI-GPT-5-high
Model Access Type=Clos...
2026.02
0.764
Minimax-M2
Model Access Type=Clos...
2026.02
0.757
WebSailor-V2-30B-A3B
Model Access Type=Open...
2026.02
0.741
WebLeaper-30B-A3B
Model Access Type=Open...
2026.02
0.732
IterResearch-30B-A3B
Model Access Type=Open...
2026.02
0.728
GLM-4.6
Model Access Type=Clos...
2026.02
0.719
Claude-4.5-Sonnet
Model Access Type=Clos...
2026.02
0.712
Tongyi-DeepResearch 30B
Model Access Type=Open...
2026.02
0.709
MiroThinker 8B
Model Access Type=Open...
2026.02
0.664
AgentCPM-Explore-4B
Model Access Type=Open...
2026.02
0.639
DeepSeek-V3.2
Model Access Type=Clos...
2026.02
0.635
Merged-Model-4B
Model Access Type=Open...
2026.02
0.601
ASearcher-QWQ-32B v2
Model Access Type=Open...
2026.02
0.587
WebDancer-QwQ-32B
Model Access Type=Open...
2026.02
0.515
Feedback
Search any
task
Search any
task