Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multitask Language Understanding on MMLU (Score (%), Inference Speedup)

67.6MMLU Score (%)

Full

53.5657.20560.8564.495Mar 5, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
67.6-
2026.03
66.42.12
2026.03
54.22.32
2026.03
54.1-