Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Meta-evaluation on Overall Average
Loading...
9.7
Accuracy Improvement
Ultra-Dense Prompting
7.204
7.852
8.5
9.148
Apr 19, 2026
Accuracy Improvement
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy Improvement
Ultra-Dense Prompting
Model=Gemini 2.0
2026.04
9.7
Ultra-Dense Prompting
Model=Claude 3.7
2026.04
8.6
Ultra-Dense Prompting
Model=Average across M...
2026.04
8.4
Ultra-Dense Prompting
Model=GPT-4o
2026.04
7.9
Ultra-Dense Prompting
Model=GPT-4o-mini
2026.04
7.3
Feedback
Search any
task
Search any
task