Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Global MMLU

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multiple Choice Question AnsweringGlobal-MMLU Medical
Accuracy (ZH)89.1
17
Multi-task Language UnderstandingGlobal MMLU-Lite Māori
Accuracy54.64
10
Multilingual Multiple-Choice ReasoningGlobal MMLU 42 languages 1.0 (test)
Average Accuracy54.8
6
Multilingual General KnowledgeGlobal MMLU Lite (subset of 18 languages)
Accuracy53.73
6
Cross-lingual Reasoning and Factual KnowledgeGlobal MMLU (test)
Accuracy (RUS)23.46
2
Showing 5 of 5 rows