Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Tool Planning on Black-Box Tool Planning (test)
Loading...
70.58
Success Rate
TOPGUN
42.6768
49.9209
57.165
64.4091
Feb 15, 2024
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
TOPGUN
Base Model=GPT-4, Sett...
2024.02
70.58
DFSDT
Base Model=GPT-4, Sett...
2024.02
61.45
ReAct
Base Model=GPT-4, Sett...
2024.02
45.45
ReverseChain
Base Model=GPT-4, Sett...
2024.02
43.75
Feedback
Search any
task
Search any
task