Share your thoughts, 1 month free Claude Pro on usSee more

Reasoning on ARC-AGI public evaluation set V2

97.9Accuracy

Confluence Lab

Updated 9d ago

Evaluation Results

Method	Links
Confluence Lab 2026.04		97.9	11.77	-
SQUEEZE EVOLVE 2026.04		97.5	7.74	3.7
SQUEEZE EVOLVE 2026.04		97.5	5.93	4.9
Imbue 2026.04		95.1	8.71	-
SQUEEZE EVOLVE 2026.04		94.2	5.62	5.1
RSA 2026.04		93.3	28.85	1