Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Code Generation on LiveCodeBench (Score)
Loading...
73.1
Score
Deepseek-R1
22.66
35.755
48.85
61.945
May 23, 2026
Score
Updated 8d ago
Evaluation Results
Method
Method
Links
Score
Deepseek-R1
Cost=12.327
2026.05
73.1
SFT-based Classification Router
Routing Strategy=Auto-...
2026.05
70.5
MoMA Router
Routing Strategy=Perfo...
2026.05
66.5
Qwen3-235B-A22B
Cost=14.65
2026.05
65.9
Contrastive learning based Router
Routing Strategy=Perfo...
2026.05
61.3
Qwen3-32B
Cost=14.65
2026.05
60.7
MoMA Router
Routing Strategy=Auto-...
2026.05
45.3
Contrastive learning based Router
Routing Strategy=Auto-...
2026.05
40.1
Contrastive learning based Router
Routing Strategy=Cost-...
2026.05
27.6
Deepseek-V3
Cost=9.498
2026.05
27.2
JT-Code-8B
Cost=1.667
2026.05
26.3
MoMA Router
Routing Strategy=Cost-...
2026.05
24.6
Feedback
Search any
task
Search any
task