Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Intake on H-AdminSim Primary Level 1.0
Loading...
88.9
Success Rate
Gemini 2.5 Flash
66.124
72.037
77.95
83.863
Feb 5, 2026
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
Gemini 2.5 Flash
Reasoning Option=dynam...
2026.02
88.9
GPT-5 Nano
Reasoning Option=low
2026.02
78.8
GPT-5 Mini
Reasoning Option=low
2026.02
67
Feedback
Search any
task
Search any
task