Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multilingual Language Understanding on MMLU ProX-Lite
Loading...
78.2
Accuracy (en)
Full translation
76.64
77.045
77.45
77.855
Oct 31, 2025
Accuracy (en)
Accuracy (de)
Accuracy (es)
Accuracy (ar)
Accuracy (ja)
Accuracy (ko)
Accuracy (th)
Accuracy (bn)
Accuracy (sw)
Accuracy (te)
Average Accuracy
Translator Usage (%)
Updated 5d ago
Evaluation Results
Method
Method
Links
Accuracy (en)
Accuracy (de)
Accuracy (es)
Accuracy (ar)
Accuracy (ja)
Accuracy (ko)
Accuracy (th)
Accuracy (bn)
Accuracy (sw)
Accuracy (te)
Average Accuracy
Translator Usage (%)
Full translation
Model=gpt-oss-20b
2025.10
78.2
77.7
78.6
77.8
76
76.3
75.7
76.1
74.4
76.9
76.8
100
Base
Model=gpt-oss-20b
2025.10
77.4
77.8
77.3
74.1
77.4
73.9
74.6
75.6
66.9
77
75.2
0
Selective translation
Model=gpt-oss-20b
2025.10
76.7
77
77.4
74.6
76
74.1
75
75.5
68.9
76.1
75.1
7.2
Feedback
Search any
task
Search any
task