Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Commonsense Reasoning on ARC Challenge (val)
Loading...
91.7
Accuracy
SKILL-MOE
88.476
89.313
90.15
90.987
Mar 7, 2025
Accuracy
Updated 1d ago
Evaluation Results
Method
Method
Links
Accuracy
SKILL-MOE
2025.03
91.7
Mixture-of-Agents
2025.03
90.1
Self-Consistency
Aggregation=Best ×5
2025.03
89.3
ReConcile
2025.03
89
Task-Best Model
2025.03
88.6
Feedback
Search any
task
Search any
task