Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Single-hop Question Answering on NQ
Loading...
43
EM
HELP
21.992
27.446
32.9
38.354
Jan 9, 2025
Mar 18, 2025
May 26, 2025
Aug 2, 2025
Oct 10, 2025
Dec 17, 2025
Feb 24, 2026
EM
F1
Updated 4d ago
Evaluation Results
Method
Method
Links
EM
F1
HELP
Backbone=Qwen3-30B-A3B...
2026.02
43
56.9
HippoRAG2
Backbone=Qwen3-30B-A3B...
2026.02
40.4
54.4
Llama3.3-70B
Reasoning Protocol=Dir...
2025.01
36
48.7
Search-o1
Reasoning Protocol=Ret...
2025.01
34
49.7
RAgent-QwQ-32B
Reasoning Protocol=Ret...
2025.01
33.8
48.4
RAG-Qwen2.5-32B
Reasoning Protocol=Ret...
2025.01
33.4
49.3
RAgent-Qwen2.5-32B
Reasoning Protocol=Ret...
2025.01
32.4
47.8
LinearRAG
Backbone=Qwen3-30B-A3B...
2026.02
31.6
43.8
HyperGraphRAG
Backbone=Qwen3-30B-A3B...
2026.02
30.3
42.1
RAG-QwQ-32B
Reasoning Protocol=Ret...
2025.01
29.6
44.4
Qwen2.5-72B
Reasoning Protocol=Dir...
2025.01
27.6
41.2
QwQ-32B
Reasoning Protocol=Dir...
2025.01
23
33.1
Qwen2.5-32B
Reasoning Protocol=Dir...
2025.01
22.8
33.9
Feedback
Search any
task
Search any
task