Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code Generation on LiveCodeBench v6 (Pass@1)
Loading...
56.3
Pass@1
MiMo-RL 7B-R-TAP
22.604
31.352
40.1
48.848
Mar 2, 2026
Pass@1
Updated 1mo ago
Evaluation Results
Method
Method
Links
Pass@1
MiMo-RL 7B-R-TAP
Model Backbone=7B
2026.03
56.3
MiMo 7B-RL
Model Backbone=7B
2026.03
49.3
OpenAI o1-mini
Model Backbone=o1-mini
2026.03
46.8
QwQ-32B Preview
Model Backbone=QwQ-32B
2026.03
39.1
Claude 3.5 Sonnet-1022
Model Backbone=Claude...
2026.03
37.2
R1-Distill-Qwen-14B
Model Backbone=Qwen-14B
2026.03
31.9
GPT-4o 0513
Model Backbone=GPT-4o
2026.03
30.9
R1-Distill-Qwen-7B
Model Backbone=Qwen-7B
2026.03
23.9
Feedback
Search any
task
Search any
task