Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Web Search on HotpotQA (Latency and Cost)
Loading...
88.5
Performance Score
MasRouter
81.116
83.033
84.95
86.867
Jan 6, 2026
Performance Score
Cost ($)
Delay (h)
Updated 4d ago
Evaluation Results
Method
Method
Links
Performance Score
Cost ($)
Delay (h)
MasRouter
Setting=Routing
2026.01
88.5
59.5
53.1
EvoRoute
Setting=Ours
2026.01
87.8
49.1
60.5
Gemini-2.5-pro
Setting=Manual
2026.01
87.4
343.43
79.18
GPT-4.1
Setting=Manual
2026.01
87
213.22
78.77
GraphRouter
Setting=Routing
2026.01
86.1
64.8
61.8
PromptLLM
Setting=Routing
2026.01
85.2
47.02
62.46
GPT-4o
Setting=Manual
2026.01
83.84
391.98
67.29
Qwen3-14b
Setting=Manual
2026.01
81.4
22.96
35.52
Feedback
Search any
task
Search any
task