Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multilingual Multiple-Choice Reasoning on Global PIQA 116 languages 1.0 (test)

79.31Accuracy

Qwen3.5-4B

62.555666.905371.25575.6047Mar 12, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
79.31
2026.03
74.6
2026.03
70.8
2026.03
70.7
2026.03
68.3
2026.03
63.2