Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MMMLU

Benchmarks

Task NameDataset NameSOTA ResultTrend
General Knowledge EvaluationMMMLU
MMMLU General Knowledge Accuracy82.25
29
Multilingual Language UnderstandingMMMLU (Massive Multilingual Language Understanding)
Accuracy79.5
21
Multilingual Language UnderstandingMMMLU
Accuracy (Korean)60.5
20
Multilingual KnowledgeMMMLU
Accuracy87.2
18
Multitask Language UnderstandingMMMLU Swahili 1.0 (test)
Accuracy33.38
18
Multitask Language UnderstandingMMMLU Korean 1.0 (test)
Accuracy41.94
18
Multitask Language UnderstandingMMMLU non-EU languages (test)
Accuracy77.4
16
Multitask Language UnderstandingMMMLU 24 official EU languages
Overall Score80.6
14
Chinese Language UnderstandingMMMLU
MMMLU Score37.08
8
Question AnsweringMMMLU
Accuracy36.14
8
MultilingualityMMMLU ko, de, es, ja
Average Score86.3
4
Multilingual Language UnderstandingMMMLU 5-shot
Accuracy78.94
3
Showing 12 of 12 rows