Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multilingual Multiple-Choice Reasoning on Global MMLU 42 languages 1.0 (test)

54.8Average Accuracy

Qwen3.5-4B

36.49641.2484650.752Mar 12, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
54.8
2026.03
49.3
2026.03
46.8
2026.03
45.3
2026.03
44.9
2026.03
37.2