Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Task Planning Evaluation Dataset

Benchmarks

Task NameDataset NameSOTA ResultTrend
Task PlanningTask Planning Evaluation Dataset sequential tasks GPT-4 annotated (test)
Edit Distance0.54
3
Showing 1 of 1 rows