Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Function Calling on BFCL unseen (test)
Loading...
48.5
Accuracy
RimRule
44.86
45.805
46.75
47.695
Dec 31, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
RimRule
Adaptation Method=Recu...
2025.12
48.5
Few-shot
Adaptation Method=Few-...
2025.12
46.6
SEE
Adaptation Method=Self...
2025.12
45.5
Zero-shot
Adaptation Method=Zero...
2025.12
45
Feedback
Search any
task
Search any
task