Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Coding Reasoning on LiveCodeBench
Loading...
73.5
Medium Pass@1
BeamSearch-IS
68.716
69.958
71.2
72.442
May 13, 2026
Medium Pass@1
Medium Pass@8
Hard Pass@1
Hard Pass@8
Overall Pass@1
Overall Pass@8
Updated 20d ago
Evaluation Results
Method
Method
Links
Medium Pass@1
Medium Pass@8
Hard Pass@1
Hard Pass@8
Overall Pass@1
Overall Pass@8
BeamSearch-IS
Training Paradigm=Cont...
2026.05
73.5
89
33.9
57.2
52.5
70.2
BoN
Training Paradigm=Cont...
2026.05
72.5
83.6
28.7
50.3
49.2
65.9
Seq-IS
Training Paradigm=Cont...
2026.05
71.6
86.2
29.6
52.1
49.3
68.1
Gemini-2.5-Flash
2026.05
71.5
83.8
30
49.6
49.4
65.6
BeamSearch
Training Paradigm=Cont...
2026.05
69.9
84.2
31.2
52
49.3
67.1
Seq
Training Paradigm=Cont...
2026.05
68.9
85.6
31.5
51.6
49
67.5
Feedback
Search any
task
Search any
task