Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multitask Language Understanding on MMLU (Per-Language Accuracy Breakdown)

89Arabic Accuracy

o1

70.165675.055379.94584.8347Dec 21, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.12
8987.3488.9292.389.3289.0488.3388.6189.788.8788.2489.5289.9285.475.38
2024.12
88.2186.228890.888.6185.7387.8288.2188.7287.8888.1588.5988.9384.7973.73
2024.12
81.5580.0783.3588.784.3782.9280.6183.4484.3582.8782.6284.2784.9377.0861.95
2024.12
79.4577.2581.885.282.1281.2278.8781.7482.2281.2980.282.4383.0370.1558.07
2024.12
70.8965.7773.058276.5974.3169.1674.5276.472.5572.0376.7777.3761.9145.83