Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Tool Use on API Bank
Loading...
90
Accuracy
Llama 3.1 Instruct
10.232
30.941
51.65
72.359
Apr 23, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Llama 3.1 Instruct
Model Scale=70B
2025.04
90
Llama 3 Instruct
Model Scale=70B
2025.04
85.2
ParamΔ
Model Scale=70B
2025.04
82.9
Llama 3.1 Instruct
Model Scale=8B
2025.04
82.1
ParamΔ
Model Scale=8B
2025.04
51.9
Llama 3 Instruct
Model Scale=8B
2025.04
48.9
Llama 3 Base
Model Scale=70B
2025.04
37.9
Llama 3 Base
Model Scale=8B
2025.04
25.3
Llama 3.1 Base
Model Scale=8B
2025.04
24.8
Llama 3.1 Base
Model Scale=70B
2025.04
13.3
Feedback
Search any
task
Search any
task