Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Function Calling on BFCL v3 2025-08-26 (test)

50Multi-Turn Overall Accuracy

GPT-4o-2024-11-20

7.6218.622529.62540.6275Aug 18, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.08
506145.535.55886.8178.8583.3381.3171.71
2025.08
40.2557.531.5343884.9471.5277.7872.8365.41
2025.08
34.6239.529.531.53865.3574.5933.3390.6759.94
2025.08
32.54825.525.53179.7175.5283.3380.6563.01
2025.08
31.3846.519312980.2978.0572.2290.1164.17
2025.08
29.8741212334.588.5477.3483.3376.4964.71
2025.08
20.88391210.52275.9261.5772.2246.2552.1
2025.08
12.5171310.59.589.9862.2410054.7853.57
2025.08
9.2512107884.2161.0877.7848.8249.57