Share your thoughts, 1 month free Claude Pro on usSee more

Abstraction and Reasoning on ARC-AGI 2 (public evaluation)

100Pass@2

Human Panel

Updated 3mo ago

Evaluation Results

Method	Links
Human Panel 2026.04		100
CoreThink Meta-Classifier 2026.04		30.8
J. Berman 2026.04		29.4
NVARC 2026.04		27.6
Compositional Reasoner 2026.04		24.4
GPT-5-Pro 2026.04		18.3
Grok-4 (Thinking) 2026.04		16
Claude Opus 4 (16K) 2026.04		8.6
o3 (High) 2026.04		6.5
o4-mini (High) 2026.04		6.1
Claude Sonnet 4 (16K) 2026.04		5.9
o3-Pro (High) 2026.04		4.9
Gemini 2.5 Pro (32K) 2026.04		4.9