Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Expert-Level Reasoning on GAIA text-only (val)
Loading...
81.6
Inference Accuracy
ReThinker
48.736
57.268
65.8
74.332
Feb 4, 2026
Inference Accuracy
Updated 3mo ago
Evaluation Results
Method
Method
Links
Inference Accuracy
ReThinker
Model Category=Inferen...
2026.02
81.6
Gemini-3-Pro
Model Category=Foundat...
2026.02
79
GPT-5-high
Model Category=Foundat...
2026.02
76.4
MiroThinker-v1.0
Model Category=Inferen...
2026.02
73.5
ReThinker
Model Category=Inferen...
2026.02
72.8
GLM-4.6
Model Category=Foundat...
2026.02
71.9
Claude-4.5-Sonnet
Model Category=Foundat...
2026.02
71.2
Tongyi DeepResearch
Model Category=Inferen...
2026.02
70.9
OpenAI DeepResearch
Model Category=Inferen...
2026.02
67.4
DeepSeek-V3.2
Model Category=Foundat...
2026.02
63.5
Kimi K2
Model Category=Foundat...
2026.02
57.7
WebExplorer
Model Category=Inferen...
2026.02
50
Feedback
Search any
task
Search any
task