Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Conditional aggregation on Clevr (test)
Loading...
3.13
Runtime (seconds)
UQE-claude-3-haiku
2.1328
8.8639
15.595
22.3261
Jun 23, 2024
Runtime (seconds)
Updated 4d ago
Evaluation Results
Method
Method
Links
Runtime (seconds)
UQE-claude-3-haiku
Model=Claude 3 Haiku
2024.06
3.13
lc-gpt-4-turbo
Model=GPT-4 Turbo
2024.06
28.06
Feedback
Search any
task
Search any
task