Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Reasoning on M-HellaSwag 30 languages

49.29Macro Accuracy

Llama 3.1

48.40648.635548.86549.0945May 21, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.05
49.29
2026.05
48.44