Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multilingual Reasoning on Polymath Low
Loading...
96.5
Accuracy (en)
Base
95.98
96.115
96.25
96.385
Oct 31, 2025
Accuracy (en)
Accuracy (de)
Accuracy (es)
Accuracy (ar)
Accuracy (ja)
Accuracy (ko)
Accuracy (th)
Accuracy (bn)
Accuracy (sw)
Accuracy (te)
Average Accuracy
Translator Usage Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy (en)
Accuracy (de)
Accuracy (es)
Accuracy (ar)
Accuracy (ja)
Accuracy (ko)
Accuracy (th)
Accuracy (bn)
Accuracy (sw)
Accuracy (te)
Average Accuracy
Translator Usage Rate
Base
Model=Qwen3-4B
2025.10
96.5
88
93.9
89.6
85.3
90.7
85.1
83.2
29.3
69.9
81.1
0
Selective translation
Model=Qwen3-4B
2025.10
96.3
88.3
94.4
90.4
86.1
91.5
88.3
86.7
81.3
77.1
88
19.3
Full translation
Model=Qwen3-4B
2025.10
96
88.3
93.3
90.9
87.5
92.5
89.6
90.4
85.3
80.5
89.4
100
Feedback
Search any
task
Search any
task