Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Single-Hop Question Answering on Natural Questions (NQ)
Loading...
64
Avg@4
OWL-8B
47.88
52.065
56.25
60.435
Feb 4, 2026
Avg@4
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg@4
OWL-8B
Setting=Multi-Agent Sy...
2026.02
64
SingleSeek-R1-4B
Setting=Single Agent
2026.02
58.8
AgentFlow-7B
Setting=Multi-Agent Sy...
2026.02
58.5
WIDESEEK-R1-4B
Setting=Multi-Agent Sy...
2026.02
56.1
ASearcher-7B
Setting=Single Agent
2026.02
54.5
MiroFlow-8B
Setting=Multi-Agent Sy...
2026.02
50.9
Search-R1-7B
Setting=Single Agent
2026.02
49.9
Qwen3-4B
Setting=Multi-Agent Sy...
2026.02
49.6
Qwen3-4B
Setting=Single Agent
2026.02
48.5
Feedback
Search any
task
Search any
task