Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Tool Learning on API-Bank LV2
Loading...
62.41
Correctness
ToolCoder
33.7268
41.1734
48.62
56.0666
Feb 17, 2025
Correctness
Updated 4d ago
Evaluation Results
Method
Method
Links
Correctness
ToolCoder
Backbone=gpt-4o-mini,...
2025.02
62.41
ToolCoder w/o Reusable Repository
Backbone=gpt-4o-mini,...
2025.02
62.41
EasyTool
Backbone=gpt-4o-mini,...
2025.02
58.24
ToolCoder w/o Error Reflection
Backbone=gpt-4o-mini,...
2025.02
58.02
ReAct
Backbone=gpt-4o-mini,...
2025.02
56.3
CodeAct
Backbone=gpt-4o-mini,...
2025.02
54.07
ATC
Backbone=gpt-4o-mini,...
2025.02
52.18
Chameleon
Backbone=gpt-4o-mini,...
2025.02
37.04
ConAgents
Backbone=gpt-4o-mini,...
2025.02
36.24
RestGPT
Backbone=gpt-4o-mini,...
2025.02
34.83
Feedback
Search any
task
Search any
task