Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic/Tool Use on Agentic/tool BFCL and Tool-2
Loading...
0.711
Overall Score
DASD
0.6954
0.69945
0.7035
0.70755
May 21, 2026
Overall Score
Updated 12d ago
Evaluation Results
Method
Method
Links
Overall Score
DASD
Backbone=Qwen3-8B
2026.05
0.711
OPSD
Backbone=Qwen3-8B
2026.05
0.703
GRPO
Backbone=Qwen3-8B
2026.05
0.701
Qwen3-8B
Post-training=Base
2026.05
0.696
Feedback
Search any
task
Search any
task