Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

HuggingGPT Evaluation Dataset

Benchmarks

Task NameDataset NameSOTA ResultTrend
Task planningHuggingGPT Evaluation Dataset graph tasks GPT-4 annotated
GPT-4 Score50.48
3
Showing 1 of 1 rows