Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Reasoning on LiveCodeBench v6 (Acc avg@32)

73.8Accuracy avg@32

IOP-GSPO

60.07263.63667.270.764Apr 19, 2026
Updated 26d ago

Evaluation Results

MethodLinks
2026.04
73.8
2026.04
69.6
2026.04
68.7
2026.04
67.8
2026.04
61.8
2026.04
60.6