Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Massive Multitask Language Understanding on MMLU (LLaMA 1B variant)

0.001vNMSE

DynamiQ

-0.0068410.0458170.0984750.151133Feb 9, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
0.001
2026.02
0.003
2026.02
0.013
2026.02
0.0453
2026.02
0.0904
2026.02
0.196