Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LiveCodeBench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Code GenerationLiveCodeBench
Pass@190.7
89
Code GenerationLiveCodeBench
Pass@195.8
86
Code GenerationLiveCodeBench
Average Score168
68
Code ReasoningLiveCodeBench
Accuracy87.4
62
Code GenerationLiveCodeBench
Accuracy67
60
Code GenerationLiveCodeBench v6
Accuracy100
58
Code GenerationLiveCodeBench
Pass@11,784
51
Code GenerationLiveCodeBench
Pass@188.1
48
Code GenerationLiveCodeBench v6
Score91.7
41
Code GenerationLiveCodeBench (test)
Pass@1 Overall53.6
38
CodeLiveCodeBench V5-6
Accuracy50.8
33
CodeLiveCodeBench V1-4
Accuracy47.1
33
Competitive ProgrammingLiveCodeBench Pro 25Q2
Easy Score94.8
33
Competitive ProgrammingLiveCodeBench Pro 25Q1
Easy Score96.6
33
Code VerificationLiveCodeBench
Pass@139.31
32
Code GenerationLiveCodeBench
Accuracy79.5
30
CodingLiveCodeBench v5
Accuracy77.6
29
ReasoningLiveCodeBench
LiveCodeBench Score54.25
27
Code GenerationLiveCodeBench v3
Score90.2
26
Code GenerationLiveCodeBench
Speedup3.52
24
Code generationLiveCodeBench Jan-Apr 2025
Accuracy (pass@1)47.25
24
Code GenerationLiveCodeBench Medium
Accuracy96.79
23
CodingLiveCodeBench
Task Accuracy79
23
Competitive ProgrammingLiveCodeBench v5
Score82.8
22
Code GenerationLiveCodeBench (LCB)
FUNC Score73.1
21
Showing 25 of 128 rows