| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multiple Choice Question Answering | Global-MMLU Medical | Accuracy (ZH)89.1 | 17 | |
| Multi-task Language Understanding | Global MMLU-Lite Māori | Accuracy54.64 | 10 | |
| Multilingual General Knowledge | Global MMLU Lite (subset of 18 languages) | Accuracy53.73 | 6 | |
| Cross-lingual Reasoning and Factual Knowledge | Global MMLU (test) | Accuracy (RUS)23.46 | 2 |