Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Understanding on MMLU (T1)

48.1Accuracy

Random Rehearsal

-1.311.52524.3537.175Apr 22, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
48.1
2026.04
47.9
2026.04
47.7
2026.04
47.7
2026.04
47.5
2026.04
0.604
2026.04
0.603
2026.04
0.6
2026.04
0.6
2026.04
0.6