Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Task Planning on HuggingGPT Human Evaluation Set 130 diverse requests (test)

0.9122Passing Rate

HuggingGPT

0.4943280.6028140.71130.819786Mar 30, 2023
Updated 4d ago

Evaluation Results

MethodLinks
2023.03
0.91220.7847
2023.03
0.79410.5841
2023.03
0.51040.3217