Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Tool Retrieval and Function Selection on Taskbench DL
Loading...
58.9
Function Selection Accuracy
Dynamic LR (DTDR-L)
0.14
15.395
30.65
45.905
Dec 18, 2025
Function Selection Accuracy
MRR
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Function Selection Accuracy
MRR
F1 Score
Dynamic LR (DTDR-L)
LLM=Qwen 3 0.6B, ICL M...
2025.12
58.9
0.85
0.64
Dynamic DR (DTDR-C)
LLM=Qwen 3 0.6B, ICL M...
2025.12
28.2
0.54
0.29
LR
LLM=Qwen 3 0.6B, ICL M...
2025.12
23.6
0.55
0.44
QTS (Paramanayakam et al. 2025)
LLM=Qwen 3 0.6B, ICL M...
2025.12
18.9
0.4
0.26
DR
LLM=Qwen 3 0.6B, ICL M...
2025.12
14.7
0.36
0.18
BM-25
LLM=Qwen 3 0.6B, ICL M...
2025.12
13.5
0.31
0.15
QTS (Gao et al. 2025)
LLM=Qwen 3 0.6B, ICL M...
2025.12
13.3
0.17
0.06
QTS (Vanilla)
LLM=Qwen 3 0.6B, ICL M...
2025.12
7.5
0.16
0.07
Random Guess
LLM=Qwen 3 0.6B, ICL M...
2025.12
2.4
0.15
0.07
Feedback
Search any
task
Search any
task