Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Strategy Prediction on ESC (test)

0.67Error Rate Metric (EMR)

GPT-5

-0.64448.227817.125.9722Apr 20, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
0.6712.2632.04
2026.04
12.8518.6114.07
2026.04
15.1519.7713.69
2026.04
23.6128.6312.5
2026.04
24.9930.1512.66
2026.04
25.2128.2812.7
2026.04
29.5534.312.34
2026.04
29.7234.9212.37
2026.04
29.9736.2212.43
2026.04
33.5337.9712.13