Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Abstract Reasoning on ARC-AGI v1 (test)
Loading...
98
Accuracy
Best Human
12.512
34.706
56.9
79.094
May 19, 2026
Accuracy
Updated 14d ago
Evaluation Results
Method
Method
Links
Accuracy
Best Human
2026.05
98
Gemini 3 Pro
#Params=N/A
2026.05
75
Grok-4-thinking
#Params=1.7T
2026.05
66.7
Avg. Human
2026.05
60.2
GPT 5.2 (low)
#Params=N/A
2026.05
55.7
GRAM
#Params=10M, Supervisi...
2026.05
52
TRM
#Params=7M, Supervisio...
2026.05
44.6
HRM
#Params=27M, Supervisi...
2026.05
40.3
o3-mini-high
#Params=N/A
2026.05
34.5
Claude 3.7 16k
#Params=N/A
2026.05
28.6
Direct Pred
#Params=27M, Supervisi...
2026.05
21
Deepseek-R1
#Params=671B
2026.05
15.8
Feedback
Search any
task
Search any
task