Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reasoning on Hebrew Reasoning Benchmarks Suite (Copa, ARC-AI2, HellaSwag, MMLU, GSM8K, Psychometric Psi)

93.3Copa (HE)

Gemma-3-27B-IT

87.78889.21990.6592.081May 11, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2026.05
93.391.463.672.582.854.376.3
2026.05
91.98858.968.483.352.573.8
2026.05
8891.261.760.270.242.368.9