Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multilingual Multiple-Choice Reasoning on Global PIQA 116 languages 1.0 (test)
Loading...
79.31
Accuracy
Qwen3.5-4B
62.5556
66.9053
71.255
75.6047
Mar 12, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen3.5-4B
decoding=greedy, param...
2026.03
79.31
Qwen3-4B
decoding=greedy, param...
2026.03
74.6
Gemma3-4B
decoding=greedy, param...
2026.03
70.8
Ministral-3-3B
decoding=greedy, param...
2026.03
70.7
Tiny Aya Global
decoding=greedy
2026.03
68.3
SmolLM3-3B
decoding=greedy, param...
2026.03
63.2
Feedback
Search any
task
Search any
task