Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Logic Reasoning on ARC (eval)
Loading...
75
Tasks Solved
NSA (ours)
3.24
21.87
40.5
59.13
Jan 8, 2025
Tasks Solved
Updated 1d ago
Evaluation Results
Method
Method
Links
Tasks Solved
NSA (ours)
Test-time adaptation (...
2025.01
75
NSA w/o TTA
Test-time adaptation (...
2025.01
63
CodeIt [5]
2025.01
59
Mirchandani [19]
2025.01
27
Ainooson Brute Force [2]
Search strategy=Brute...
2025.01
26
Ferre [11]
2025.01
23
ARGAe
Domain Specific Langua...
2025.01
22
Ainooson MLE [2]
Search strategy=MLE
2025.01
17
ARGA [27]
Domain Specific Langua...
2025.01
9
Ferre [10]
2025.01
6
Feedback
Search any
task
Search any
task