Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Maze Navigation on Maze Hard

97.66Accuracy

GPT-5

-3.906422.461848.8375.1982Nov 28, 2025Dec 26, 2025Jan 24, 2026Feb 22, 2026Mar 22, 2026Apr 20, 2026May 19, 2026
Updated 14d ago

Evaluation Results

MethodLinks
2025.11
97.66
2025.11
93.36
2026.05
86.73
2026.05
85.3
2026.05
83.8
2025.11
78.52
2026.05
74.5
2025.11
68.36
2025.11
63.28
2025.11
50.59
2025.11
36.52
2025.11
28.52
2025.11
26.51
2025.11
17.76
2025.11
1.56
2025.11
0.39
2025.11
0.2
2025.11
0