Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Abstract Visual Reasoning on ARC-AGI 1
Loading...
98
Accuracy (Pass@2)
best.human
12.512
34.706
56.9
79.094
Feb 2, 2026
Accuracy (Pass@2)
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy (Pass@2)
best.human
2026.02
98
Bespoke (Grok-4)
#Params=1.7T
2026.02
79.6
Grok-4-thinking
#Params=1.7T
2026.02
66.7
Loop-ViT (Large)
#Params=18M
2026.02
65.8
Loop-ViT (Medium)
#Params=11.2M
2026.02
63.8
VARC (ensemble)
#Params=73M
2026.02
60.4
avg.human
2026.02
60.2
Loop-ViT (Small)
#Params=3.8M
2026.02
60.1
VARC
#Params=18M
2026.02
54.5
TRM
#Params=7M
2026.02
44.6
GPT-5
2026.02
44
HRM
#Params=27M
2026.02
40.3
o3-mini-high
2026.02
34.5
Claude 3.7 8k
2026.02
21.2
Deepseek R1
#Params=671B
2026.02
15.8
Feedback
Search any
task
Search any
task