Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Question Answering on ARC Challenge (Accuracy improvement (Δ))
Loading...
7.4
Accuracy Improvement (Δ)
Ultra-Dense Prompting
5.424
5.937
6.45
6.963
Apr 19, 2026
Accuracy Improvement (Δ)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy Improvement (Δ)
Ultra-Dense Prompting
Model=Gemini 2.0, Prom...
2026.04
7.4
Ultra-Dense Prompting
Model=Claude 3.7, Prom...
2026.04
6.8
Ultra-Dense Prompting
Model=Average across M...
2026.04
6.4
Ultra-Dense Prompting
Model=GPT-4o, Prompt S...
2026.04
6.1
Ultra-Dense Prompting
Model=GPT-4o-mini, Pro...
2026.04
5.5
Feedback
Search any
task
Search any
task