Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agent on GAIA Text-Only
Loading...
79.9
Score
Kimi-K2.5
46.516
55.183
63.85
72.517
Mar 26, 2026
Score
Updated 22d ago
Evaluation Results
Method
Method
Links
Score
Kimi-K2.5
Number of Parameters=1...
2026.03
79.9
Intern-S1-Pro
Number of Parameters=1...
2026.03
77.4
Gemini-3-Pro
2026.03
75.5
GPT-5.2
2026.03
71.1
Qwen3-VL-235B-Thinking
Number of Parameters=2...
2026.03
47.8
Feedback
Search any
task
Search any
task