Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Tool Retrieval and Function Selection on Taskbench-MM
Loading...
27
Function Selection Accuracy
Dynamic LR (DTDR-L)
1.416
8.058
14.7
21.342
Dec 18, 2025
Function Selection Accuracy
MRR
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Function Selection Accuracy
MRR
F1 Score
Dynamic LR (DTDR-L)
LLM=Qwen 3 0.6B, ICL M...
2025.12
27
0.69
0.55
Dynamic DR (DTDR-C)
LLM=Qwen 3 0.6B, ICL M...
2025.12
24.4
0.52
0.43
LR
LLM=Qwen 3 0.6B, ICL M...
2025.12
21
0.49
0.28
BM-25
LLM=Qwen 3 0.6B, ICL M...
2025.12
13.7
0.26
0.19
DR
LLM=Qwen 3 0.6B, ICL M...
2025.12
13.3
0.38
0.27
QTS (Paramanayakam et al. 2025)
LLM=Qwen 3 0.6B, ICL M...
2025.12
7.6
0.29
0.18
QTS (Vanilla)
LLM=Qwen 3 0.6B, ICL M...
2025.12
5.3
0.14
0.04
QTS (Gao et al. 2025)
LLM=Qwen 3 0.6B, ICL M...
2025.12
5.3
0.14
0.2
Random Guess
LLM=Qwen 3 0.6B, ICL M...
2025.12
2.4
0.11
0.03
Feedback
Search any
task
Search any
task