Share your thoughts, 1 month free Claude Pro on usSee more

Natural Language Understanding on ARC Challenge

95.3Accuracy

LLaMA-3.1-405B Base

Updated 13d ago

Evaluation Results

Method	Links
LLaMA-3.1-405B Base 2026.01		95.3	-
DeepSeek-V3-Base 2026.01		95.3	-
Yuan3.0-1T Base 2026.01		94.3	-
Full-Attn 2026.02		78.4	-
HySparse 2026.02		77.6	-
HySparse 2026.02		75	-
Hybrid SWA 2026.02		74.9	-
Full-Attn 2026.02		70.2	-
Hybrid SWA 2026.02		63.9	-
NLS 2026.05		62.54	78.64
Arcana 2024.10		61.4	-
Vicuna-v1.5 2024.10		56.6	-
LLaMA-2-Chat 2024.10		54.9	-
WizardLM 2024.10		47.5	-
NLS 2026.05		43.34	66.89
LLaMA-2 2024.10		40.3	-