Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Agent Performance on AgentInstruct HELD-IN

2.75HELD-IN

GPT-4

0.08760.77881.472.1612Mar 19, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.03
2.75
2024.03
2.01
2024.03
1.96
2024.03
1.89
2024.03
1.59
2024.03
0.19