Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

OJBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Code ReasoningOJBench
Accuracy10.34
24
Code GenerationOJBench ICPC 2025 (test)
Accuracy19.18
18
Competitive CodingOJBench
Best@8 Score45.9
16
Showing 3 of 3 rows