Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Tool-augmented reasoning on BFCL Multi-Turn v3

69.1Overall Score

APIGen-MT

6.90823.05439.255.346Jan 6, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
69.17769.56367
506145.535.558
2026.01
40.357.531.53438
2026.01
3742.538.531.535.5
34.639.529.531.538
2026.01
31.446.5193129
29.941212334.5
2026.01
2742161733
2026.01
26.535.52427.519
2026.01
2531.520.524.523.5
2026.01
22.6----
17.622.511.51917.5
12.5171310.59.5
9.3121078