Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Massive Multitask Language Understanding on MMLU (Performance Profile)

56.6MMLU

Qwen3-4B + FBS-Full (ours)

54.62455.13755.6556.163Jan 29, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
56.65320.736
2026.01
56.47551.030
2026.01
55.176010
2026.01
555550.7430
2026.01
555950.8215
2026.01
54.96460.922
2026.01
54.75700.818