Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Intake on H-AdminSim Primary Level 1.0
Loading...
88.9
Success Rate
Gemini 2.5 Flash
66.124
72.037
77.95
83.863
Feb 5, 2026
Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate
Gemini 2.5 Flash
Reasoning Option=dynam...
2026.02
88.9
GPT-5 Nano
Reasoning Option=low
2026.02
78.8
GPT-5 Mini
Reasoning Option=low
2026.02
67
Feedback
Search any
task
Search any
task