Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Scheduling on H-AdminSim Tertiary Level 1.0
Loading...
99.6
Success Rate
Gemini 2.5 Flash
14.112
36.306
58.5
80.694
Feb 5, 2026
Success Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate
Gemini 2.5 Flash
Reasoning Option=dynam...
2026.02
99.6
GPT-5 Mini
Reasoning Option=low,...
2026.02
97
GPT-5 Nano
Reasoning Option=low,...
2026.02
93.3
GPT-5 Mini
Reasoning Option=low,...
2026.02
75
Gemini 2.5 Flash
Reasoning Option=dynam...
2026.02
37.9
GPT-5 Nano
Reasoning Option=low,...
2026.02
17.4
Feedback
Search any
task
Search any
task