Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
AI Agent Reasoning and Tool-use on GAIA
Loading...
78.49
Level 1 Score
h2oGPTe
57.2428
62.7589
68.275
73.7911
Feb 7, 2025
Level 1 Score
Level 2 Score
Level 3 Score
Average Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Level 1 Score
Level 2 Score
Level 3 Score
Average Score
h2oGPTe
2025.02
78.49
64.78
40.82
65.12
AgenticReasoning
Primary reasoning mode...
2025.02
74.36
69.21
45.46
66.13
OpenAI Deep Research
2025.02
74.29
69.06
47.6
67.36
InspectReAct
2025.02
67.92
59.3
30.77
57.58
Langfun
2025.02
58.06
51.57
24.49
49.17
Feedback
Search any
task
Search any
task