Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Language Understanding on PiQA, ARC, HellaSwag, WinoGrande, MMLU

75.2Aggregate Accuracy

LLaMA-3-8B-Lizard

67.50469.50271.573.498Jul 11, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.07
75.273.5
2025.07
73.971
2025.07
73.172
2025.07
71.869.5
2025.07
67.860.8