Share your thoughts, 1 month free Claude Pro on usSee more

Abstraction and Reasoning on ARC-AGI Public Training Set (Easy) (60 tasks)

0.41Total Cost

Two-step agent

Updated 4mo ago

Evaluation Results

Method	Links
Two-step agent 2025.12		0.41	-
ADAS best agent (reproduced) 2025.12		2.11	-
Two-step agent 2025.12		2.85	-
ENCOMPASS (+ global best-of-N, N = 8) 2025.12		3.29	-
ENCOMPASS (+ global best-of-N, N = 36) 2025.12		14.81	-
ENCOMPASS (+ BFS) 2025.12		15.81	-
ENCOMPASS (+ global best-of-N, N = 8) 2025.12		22.76	-
ADAS best agent (reproduced) 2025.12		27.85	-
ENCOMPASS (+ BFS) 2025.12		88.69	-
ENCOMPASS (+ global best-of-N, N = 36) 2025.12		95.98	-