Share your thoughts, 1 month free Claude Pro on usSee more

Question Answering on ARC Challenge v1 (test)

56.7Normalized Accuracy

LLaMA-3-8B-Lizard

Updated 3mo ago

Evaluation Results

Method	Links
LLaMA-3-8B-Lizard 2025.07		56.7
Mistral-7B-Lizard 2025.07		55.8
Mistral-7B-LoLCATs 2025.07		54.9
LLaMA-3-8B-LoLCATs 2025.07		54.9
Mistral-7B 2025.07		53.8
LLaMA-3-8B 2025.07		53.3
Gemma-7B 2025.07		53.2
Liger-GLA-Llama-3-8B 2025.07		52.5
Liger-GLA-Mistral-7B 2025.07		49.3
Mamba2-LLaMA-3-8B 2025.07		48
Griffin-7B 2025.07		47.9
Mamba-7B 2025.07		46.7
RWKV-6-v2.1-7B 2025.07		46.3
Hawk-7B 2025.07		45.9
Mistral-7B-SUPRA 2025.07		45.8
TransNormerLLM-7B 2025.07		44.4