Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Function Calling on Tool-Alpaca

77.66F1 Score

GPT-4o

41.509650.894860.2869.6652Oct 16, 2025Nov 13, 2025Dec 12, 2025Jan 10, 2026Feb 8, 2026Mar 9, 2026Apr 7, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2025.10
77.6688.6466.67
2025.10
76.7587.3166.18
2025.10
74.3983.2165.56
2025.10
73.4785.8261.11
2025.10
73.3681.4265.3
2025.10
73.378.8667.73
2025.10
72.9382.6863.18
2025.10
72.7780.9364.6
2025.10
72.1579.0365.26
2025.10
71.9680.7863.13
2025.10
71.8583.760
2025.10
71.5780.3162.83
2025.10
71.0379.6962.37
2025.10
69.5477.4261.66
2025.10
69.377.4261.17
2025.10
68.8977.160.67
2025.10
68.8178.2959.32
2025.10
68.7377.2960.16
2025.10
68.2572.8863.61
2026.04
67.75--
2025.10
67.6477.2758
2025.10
67.147658.27
66.67--
2026.04
65.97--
2025.10
65.3875.6455.12
2025.10
64.8673.0356.68
2026.04
64.48--
2025.10
63.1167.2658.96
2026.04
62.5--
2026.04
62.33--
2025.10
62.172.9351.26
2026.04
61.58--
2025.10
61.2270.851.63
2025.10
59.5264.3454.69
2026.04
58.96--
2025.10
57.7264.8650.58
2026.04
53.48--
2025.10
52.6562.0743.23
2026.04
50.58--
2026.04
42.9--