Share your thoughts, 1 month free Claude Pro on usSee more

General Language Understanding on LLM Evaluation Suite (PiQA, ARC, HellaSwag, WinoGrande, MMLU v1)

74.6Overall Accuracy

LLaMA-3-8B-Lizard

Updated 3mo ago

Evaluation Results

Method	Links
LLaMA-3-8B-Lizard 2025.07		74.6	72.4
Mistral-7B-LoLCATs 2025.07		74.5	70.7
Mistral-7B-Lizard 2025.07		74.5	72.2
Mistral-7B 2025.07		74.4	72.4
LLaMA-3-8B-LoLCATs 2025.07		74.2	70.7
Gemma-7B 2025.07		74.1	72.3
LLaMA-3-8B 2025.07		73.1	72
Liger-GLA-Llama-3-8B 2025.07		72.4	67.6
Griffin-7B 2025.07		71.1	65.8
Mamba-7B 2025.07		71	64.7
Liger-GLA-Mistral-7B 2025.07		70.9	65.1
Mistral-7B-SUPRA 2025.07		69.9	64
Hawk-7B 2025.07		69.6	63.8
RWKV-6-v2.1-7B 2025.07		69.4	69.4
TransNormerLLM-7B 2025.07		68.2	64.1
Mamba2-LLaMA-3-8B 2025.07		65.6	61.9