Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Tool Use on ACEBench Single
Loading...
90
Accuracy
Base
61.92
69.21
76.5
83.79
Apr 13, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Base
Model=ToolACE-2.5
2026.04
90
CAA
Model=ToolACE-2.5
2026.04
89
Prompt
Model=ToolACE-2.5
2026.04
87
Base
Model=Watt-Tool
2026.04
84
Prompt
Model=Watt-Tool
2026.04
83
CAA
Model=Watt-Tool
2026.04
83
Prompt
Model=Qwen3-8B
2026.04
72
Base
Model=Qwen3-8B
2026.04
71
Prompt
Model=Qwen3-14B
2026.04
70
Base
Model=Qwen3-14B
2026.04
68
CAA
Model=Qwen3-14B
2026.04
68
CAA
Model=Qwen3-8B
2026.04
67
Prompt
Model=Qwen3-4B
2026.04
66
Base
Model=Qwen3-4B
2026.04
65
CAA
Model=Qwen3-4B
2026.04
63
Feedback
Search any
task
Search any
task