Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Coding on LiveCodeBench 24.08-25.02
Loading...
65.9
Pass@1
DeepSeek-R1
56.852
59.201
61.55
63.899
Mar 6, 2025
Pass@1
Average Output Token Length
Updated 1mo ago
Evaluation Results
Method
Method
Links
Pass@1
Average Output Token Length
DeepSeek-R1
2025.03
65.9
10,400
TinyR1-32B-Preview
Parameters=32B
2025.03
61.6
12,400
DeepSeek-R1-Distill-Llama-70B
Parameters=70B, Backbo...
2025.03
57.5
-
DeepSeek-R1-Distill-Qwen-32B
Parameters=32B, Backbo...
2025.03
57.2
10,100
Feedback
Search any
task
Search any
task