Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Error Detection on CoSPlan Maze-E

0.403Accuracy

GPT-4o

0.050440.141970.23350.32503Dec 11, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.12
0.403
2025.12
0.353
2025.12
0.334
2025.12
0.331
2025.12
0.328
2025.12
0.261
2025.12
0.21
2025.12
0.208
2025.12
0.207
2025.12
0.205
2025.12
0.205
2025.12
0.191
2025.12
0.133
2025.12
0.084
2025.12
0.064