Share your thoughts, 1 month free Claude Pro on usSee more

Language Understanding on HellaSwag, PIQA, MMLU Suite

49HellaSwag Accuracy

OpenLLaMA (V1)

Updated 3mo ago

Evaluation Results

Method	Links
OpenLLaMA (V1) 2023.12		49	75	26.95	49.9
INCITE (V1) 2023.12		48	74	26.75	48.48
MobileLLaMA 2023.12		48	75	27.3	49.23
MobileLLaMA 2023.12		43	73	24.97	44.9
OPT 2023.12		41	71	24.61	44.66
TinyLLaMA (2T) 2023.12		40	70	25.41	44.84
Pythia 2023.12		40	71	25.68	45.77
Galactica 2023.12		34	63	26.75	44.88