Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Multitask Language Understanding on MMLU (Accuracy and Performance Gain)

73.5Accuracy

Qwen3 8B Base

39.80448.55257.366.048Jan 30, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
73.5-2.5
2026.01
73.5-2.5
2026.01
63.9-0.8
2026.01
63.6-1.1
2026.01
62.6-1.3
2026.01
62.3-1.6
2026.01
58.7-1.3
2026.01
58.5-1.5
2026.01
43.81.5
2026.01
42-1
2026.01
41.4-1.6
2026.01
41.1-1.1