Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Agent Performance on AgentInstruct HELD-IN

2.75HELD-IN

GPT-4

0.08760.77881.472.1612Mar 19, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.03
2.75
2024.03
2.01
2024.03
1.96
2024.03
1.89
2024.03
1.59
2024.03
0.19