Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Task Planning on HuggingGPT Evaluation Dataset graph tasks GPT-4 annotated
Loading...
50.48
GPT-4 Score
GPT-3.5
11.6464
21.7282
31.81
41.8918
Mar 30, 2023
GPT-4 Score
Precision
Recall
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
GPT-4 Score
Precision
Recall
F1 Score
GPT-3.5
LLM=GPT-3.5
2023.03
50.48
54.9
49.23
51.91
Vicuna-7b
LLM=Vicuna-7b
2023.03
19.17
13.97
28.08
18.66
Alpaca-7b
LLM=Alpaca-7b
2023.03
13.14
16.18
28.33
20.59
Feedback
Search any
task
Search any
task