Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Tool-use Planning on ToolBench G3-Instruction
Loading...
0.9368
Win Rate
GPT4 TOPGUN
0.534528
0.638964
0.7434
0.847836
Feb 15, 2024
Win Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Win Rate
GPT4 TOPGUN
Reference=GPT4 ReACT
2024.02
0.9368
GPT4 TOPGUN
Reference=T.LLAMA REACT
2024.02
0.9323
GPT4 TOPGUN
Reference=ChatGPT REACT
2024.02
0.9005
GPT4 TOPGUN
Reference=T.LLAMA DFSDT
2024.02
0.8947
GPT4 TOPGUN
Reference=T.LLAMA DFSD...
2024.02
0.8782
GPT4 TOPGUN
Reference=ChatGPT DFSDT
2024.02
0.8526
GPT4 DFSDT
Reference=ChatGPT REACT
2024.02
0.84
GPT4 TOPGUN
Reference=GPT4 DFSDT
2024.02
0.7925
GPT4 ReACT
Reference=ChatGPT REACT
2024.02
0.78
T.LLAMA DFSDT+Ret
Reference=ChatGPT REACT
2024.02
0.73
T.LLAMA DFSDT
Reference=ChatGPT REACT
2024.02
0.69
ChatGPT DFSDT
Reference=ChatGPT REACT
2024.02
0.69
T.LLAMA REACT
Reference=ChatGPT REACT
2024.02
0.55
Feedback
Search any
task
Search any
task