Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Capability on MMLU-Pro OpenR1-Math Harder

71.3Accuracy

Qwen-4B

70.46870.68470.971.116Feb 11, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
71.3
2026.02
71.2
2026.02
70.5