Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Task-Efficient Routing on Security Curated Task Benchmark 1.0 (test)
Loading...
0.0021
Avg. Cost
Force Weak
0.001928
0.003089
0.00425
0.005411
Jan 27, 2026
Avg. Cost
Reduction
Avg. Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg. Cost
Reduction
Avg. Score
Force Weak
Strategy=Force Weak
2026.01
0.0021
67.2
83.5
CASTER
Strategy=CASTER
2026.01
0.0049
23.4
86.2
Force Strong
Strategy=Force Strong
2026.01
0.0064
-
85.5
Feedback
Search any
task
Search any
task