Share your thoughts, 1 month free Claude Pro on usSee more

Math & Code Reasoning on ARC Easy

81.1Score

Llama 3-8B

Updated 1mo ago

Evaluation Results

Method	Links
Llama 3-8B 2026.06		81.1
Llama 2-7B 2026.06		73.8
LLaDA-8B 2026.06		71.8
Falcon-7B 2026.06		70.8
Sumi-7B 2026.06		70
OLMo-7B 2026.06		68.8
DiReCT 2026.05		47
GradNorm (IS) 2026.05		44.3
InfoBatch 2026.05		43.7
Perplexity-based 2026.05		42.4
Uniform Sampling 2026.05		41.3
Loss-based 2026.05		41
DiReCT 2026.05		34
InfoBatch 2026.05		31.7
GradNorm (IS) 2026.05		30.6
Perplexity-based 2026.05		29.8
Loss-based 2026.05		28.4
Uniform Sampling 2026.05		27.2