Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Single-Hop Question Answering on NQ (in-domain)
Loading...
45.9
Accuracy
SKILLRL
10.228
19.489
28.75
38.011
Feb 9, 2026
Accuracy
Updated 2d ago
Evaluation Results
Method
Method
Links
Accuracy
SKILLRL
Backbone=Qwen2.5-7B-In...
2026.02
45.9
ZeroSearch
Backbone=Qwen2.5-7B-In...
2026.02
43.6
EvolveR
Backbone=Qwen2.5-7B-In...
2026.02
43.5
Search-R1
Backbone=Qwen2.5-7B-In...
2026.02
39.3
RAG
Backbone=Qwen2.5-7B-In...
2026.02
27.4
R1-Instruct
Backbone=Qwen2.5-7B-In...
2026.02
21
Search-o1
Backbone=Qwen2.5-7B-In...
2026.02
19.4
CoT
Backbone=Qwen2.5-7B-In...
2026.02
12.8
Qwen2.5
Backbone=Qwen2.5-7B-In...
2026.02
11.6
Feedback
Search any
task
Search any
task