Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Single-Hop Question Answering on PopQA out-of-domain
Loading...
51.5
Accuracy
ZeroSearch
-0.812
12.769
26.35
39.931
Feb 9, 2026
Accuracy
Updated 2d ago
Evaluation Results
Method
Method
Links
Accuracy
ZeroSearch
Backbone=Qwen2.5-7B-In...
2026.02
51.5
SKILLRL
Backbone=Qwen2.5-7B-In...
2026.02
45.9
EvolveR
Backbone=Qwen2.5-7B-In...
2026.02
44.6
Search-R1
Backbone=Qwen2.5-7B-In...
2026.02
39.7
RAG
Backbone=Qwen2.5-7B-In...
2026.02
17.8
R1-Instruct
Backbone=Qwen2.5-7B-In...
2026.02
17.1
Search-o1
Backbone=Qwen2.5-7B-In...
2026.02
11.4
CoT
Backbone=Qwen2.5-7B-In...
2026.02
3.8
Qwen2.5
Backbone=Qwen2.5-7B-In...
2026.02
1.2
Feedback
Search any
task
Search any
task