Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Scientific Olympiad Reasoning on FrontierScience-Olympiad

43.5Biology Accuracy

GPT-5.2

1.3812.31523.2534.185Feb 10, 2026Feb 25, 2026Mar 12, 2026Mar 28, 2026Apr 12, 2026Apr 27, 2026May 13, 2026
Updated 20d ago

Evaluation Results

MethodLinks
2026.02
43.58974.377.1
2026.02
41.571.557.861.7
2026.02
4185.675.576.1
2026.02
38.863.461.560
2026.02
35.581.56769.7
2026.02
33.582.467.470
2026.02
3373.267.266.2
2026.02
30716865.4
3071.365.564.3
2026.02
3076.15962.9
2026.05
3056.35653.5
2026.05
27.557.557.554.5
2026.02
26.377.267.367.1
2026.02
26.374.167.365.9
2026.02
26.361.957.856.3
2026.02
26.358.157.354.5
2026.05
2574.465.565
2026.05
2569.462.561.5
2026.02
2481.872.571.4
22.567.265.862
2026.02
2076.66565.1
2026.02
2070.669.565
2058.85452.5
2026.02
2050.940.342.5
2026.02
18.849.443.543.4
2026.05
17.56054.553
2026.05
17.561.96961
2026.02
1561.956.354.4
2026.02
1047.845.342.8
2026.02
312.414.112.3