Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CodeContests

Benchmarks

Task NameDataset NameSOTA ResultTrend
Code GenerationCodeContests (test)
Pass@11,200
48
Code GenerationCodeContests
Pass@158.79
38
Code GenerationCodeContests
Avg@839.33
26
Code GenerationCodeContests
Accuracy (CC)26.7
15
Code GenerationCodeContests (evaluation set)
Pass@119.7
8
Program SynthesisCodeContests (test)
Pass@10.2045
6
Coding ReasoningCodecontests
Pass Rate65.8
5
Competition-Level Code GenerationCodeContests (val)
10@1k21
3
Showing 8 of 8 rows