Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code Reasoning on LiveCodeBench v6 (Acc avg@32)
Loading...
73.8
Accuracy avg@32
IOP-GSPO
60.072
63.636
67.2
70.764
Apr 19, 2026
Accuracy avg@32
Updated 26d ago
Evaluation Results
Method
Method
Links
Accuracy avg@32
IOP-GSPO
Model Architecture=Qwe...
2026.04
73.8
GSPO
Model Architecture=Qwe...
2026.04
69.6
Base
Model Architecture=Qwe...
2026.04
68.7
IOP-GSPO
Model Architecture=Qwe...
2026.04
67.8
GSPO
Model Architecture=Qwe...
2026.04
61.8
Base
Model Architecture=Qwe...
2026.04
60.6
Feedback
Search any
task
Search any
task