Share your thoughts, 1 month free Claude Pro on usSee more

Code Understanding and Reasoning on LiveCodeBench v6

42.5Accuracy (avg@1)

DeepSeek-R1-Distill-Llama-8B

Updated 3mo ago

Evaluation Results

Method	Links
DeepSeek-R1-Distill-Llama-8B 2025.12		42.5
Sigma-MoE-Tiny 2025.12		42.2
DeepSeek-R1-Distill-Qwen-7B 2025.12		35.7
Qwen3-1.7B 2025.12		33.2
Phi-3.5-MoE 2025.12		10.5