Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code Understanding and Reasoning on LiveCodeBench v6
Loading...
42.5
Accuracy (avg@1)
DeepSeek-R1-Distill-Llama-8B
9.22
17.86
26.5
35.14
Dec 18, 2025
Accuracy (avg@1)
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy (avg@1)
DeepSeek-R1-Distill-Llama-8B
Architecture=Dense, #...
2025.12
42.5
Sigma-MoE-Tiny
Architecture=MoE, # Ac...
2025.12
42.2
DeepSeek-R1-Distill-Qwen-7B
Architecture=Dense, #...
2025.12
35.7
Qwen3-1.7B
Architecture=Dense, #...
2025.12
33.2
Phi-3.5-MoE
Architecture=MoE, # Ac...
2025.12
10.5
Feedback
Search any
task
Search any
task