Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-hop Tool-use on ToolHop rand (test)
Loading...
31.1
Accuracy
RimRule
26.316
27.558
28.8
30.042
Dec 31, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
RimRule
Adaptation Method=Recu...
2025.12
31.1
Few-shot
Adaptation Method=Few-...
2025.12
29.9
SEE
Adaptation Method=Self...
2025.12
27.6
Zero-shot
Adaptation Method=Zero...
2025.12
26.5
Feedback
Search any
task
Search any
task