Share your thoughts, 1 month free Claude Pro on usSee more

General Capability on MMLU-Pro OpenR1-Math Harder

71.3Accuracy

Qwen-4B

Updated 5mo ago

Evaluation Results

Method	Links
Qwen-4B 2026.02		71.3
RePO 2026.02		71.2
LUFFY 2026.02		70.5