Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on ARC Challenge v1 (test)
Loading...
56.7
Normalized Accuracy
LLaMA-3-8B-Lizard
43.908
47.229
50.55
53.871
Jul 11, 2025
Normalized Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Normalized Accuracy
LLaMA-3-8B-Lizard
Training Tokens (B)=0....
2025.07
56.7
Mistral-7B-Lizard
Training Tokens (B)=0....
2025.07
55.8
Mistral-7B-LoLCATs
Training Tokens (B)=0....
2025.07
54.9
LLaMA-3-8B-LoLCATs
Training Tokens (B)=0....
2025.07
54.9
Mistral-7B
Training Tokens (B)=80...
2025.07
53.8
LLaMA-3-8B
Training Tokens (B)=15...
2025.07
53.3
Gemma-7B
Training Tokens (B)=60...
2025.07
53.2
Liger-GLA-Llama-3-8B
Training Tokens (B)=0....
2025.07
52.5
Liger-GLA-Mistral-7B
Training Tokens (B)=0....
2025.07
49.3
Mamba2-LLaMA-3-8B
Training Tokens (B)=20...
2025.07
48
Griffin-7B
Training Tokens (B)=30...
2025.07
47.9
Mamba-7B
Training Tokens (B)=12...
2025.07
46.7
RWKV-6-v2.1-7B
Training Tokens (B)=14...
2025.07
46.3
Hawk-7B
Training Tokens (B)=30...
2025.07
45.9
Mistral-7B-SUPRA
Training Tokens (B)=10...
2025.07
45.8
TransNormerLLM-7B
Training Tokens (B)=14...
2025.07
44.4
Feedback
Search any
task
Search any
task