Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Tool Learning on RestBench Spotify
Loading...
87.72
Success
ToolCoder
60.3576
67.4613
74.565
81.6687
Feb 17, 2025
Success
Path
Updated 4d ago
Evaluation Results
Method
Method
Links
Success
Path
ToolCoder
Backbone=gpt-4o-mini,...
2025.02
87.72
78.95
ToolCoder w/o Reusable Repository
Backbone=gpt-4o-mini,...
2025.02
78.95
71.93
ToolCoder w/o Error Reflection
Backbone=gpt-4o-mini,...
2025.02
73.68
70.18
CodeAct
Backbone=gpt-4o-mini,...
2025.02
71.93
66.67
Chameleon
Backbone=gpt-4o-mini,...
2025.02
70.18
63.16
ReAct
Backbone=gpt-4o-mini,...
2025.02
68.42
52.63
ATC
Backbone=gpt-4o-mini,...
2025.02
65.47
68.42
ConAgents
Backbone=gpt-4o-mini,...
2025.02
64.92
68.42
EasyTool
Backbone=gpt-4o-mini,...
2025.02
62.19
64.92
RestGPT
Backbone=gpt-4o-mini,...
2025.02
61.41
57.89
Feedback
Search any
task
Search any
task