Our new X account is live! Follow @wizwand_team for updates
Search any
task
Feedback
Search any
task
SOTA Single-hop Question Answering benchmarks and papers with code | Wizwand
Our new X account is live! Follow @wizwand_team for updates
Home
/
Tasks
Single-hop Question Answering
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
TriviaQA
Dep-Search
EM
72
62
3d ago
PopQA
HierSearch
EM
61.6
55
3d ago
LoCoMo
ShardMemo
F1
0.6408
53
3d ago
LoCoMo Single-Hop (test)
Reproduced Baselines
F1
37.9
24
3d ago
MFQA en 16k
DCS
Overall Score
23.76
22
3d ago
Qasper
Llama-3-8B-Instruct
Score
44.79
22
3d ago
NarrativeQA
DCS
Score
23.89
22
3d ago
MFQA en
Llama-3-8B-Instruct with DCS
Score
45.83
22
3d ago
PopQA (test)
InfoReasoner-3B
Accuracy
44.2
21
3d ago
TriviaQA (test)
InfoReasoner-3B
Accuracy
63.4
21
3d ago
NQ (Natural Questions) (test)
InfoReasoner-3B
Accuracy
45.3
21
3d ago
SQuAD
IN-CONTEXT
F1 Score
86.8
21
3d ago
NQ (Natural Questions) in-domain (test)
ARPO
EM
31.45
20
3d ago
Factrecall en
LM-infinite
Score
31.36
17
3d ago
Loogle SD
DCS
Score
45.1
17
3d ago
Natural Questions (NQ) (test)
ProGraph-R1
EM
35.94
16
3d ago
Natural Questions
Amber
Accuracy
47.8
15
3d ago
Complex-TR ODQA 1.0 (test)
FiD-PIT + Refine
Set Accuracy
49
13
3d ago
PopQA out-of-domain
ZeroSearch
Accuracy
51.5
9
2d ago
TriviaQA (out-of-domain)
EvolveR
Accuracy
63.4
9
2d ago
NQ (in-domain)
SKILLRL
Accuracy
45.9
9
2d ago
Natural Questions (NQ)
OWL-8B
Avg@4
64
9
3d ago
NQ 2019 (test)
CIRAG
F1 Score
61.4
8
3d ago
MS MARCO V2
SHINE
Answer F1 Score
40.8
6
3d ago
MS MARCO V1
SHINE
F1 Score (Answer)
40.7
6
3d ago
Showing 25 of 29 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Terms of Service
FAQs