Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Conditional abstraction and aggregation on AirDialog
Loading...
0.04
Cost
UQE-claude-3-haiku
0.0268
0.1159
0.205
0.2941
Jun 23, 2024
Cost
EMD
Updated 4d ago
Evaluation Results
Method
Method
Links
Cost
EMD
UQE-claude-3-haiku
Backbone=claude-3-haiku
2024.06
0.04
-
lc-gpt-4-turbo
Backbone=gpt-4-turbo
2024.06
0.21
-
lc-claude-3-opus
Backbone=claude-3-opus
2024.06
0.37
-
Feedback
Search any
task
Search any
task