Share your thoughts, 1 month free Claude Pro on usSee more

Multi-task Language Understanding on MMLU v1 (test)

66.6Accuracy

LLaMA-3-8B

Updated 3mo ago

Evaluation Results

Method	Links
LLaMA-3-8B 2025.07		66.6
Gemma-7B 2025.07		62.9
Mistral-7B 2025.07		62.4
LLaMA-3-8B-Lizard 2025.07		61.2
Mistral-7B-Lizard 2025.07		60.8
LLaMA-3-8B-LoLCATs 2025.07		52.8
Mistral-7B-LoLCATs 2025.07		51.4
Liger-GLA-Llama-3-8B 2025.07		43.4
Mamba2-LLaMA-3-8B 2025.07		43.2
TransNormerLLM-7B 2025.07		43.1
Griffin-7B 2025.07		39.3
Liger-GLA-Mistral-7B 2025.07		36.3
Hawk-7B 2025.07		35
Mistral-7B-SUPRA 2025.07		34.2
Mamba-7B 2025.07		33.3