Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Single-hop Question Answering benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Single-hop Question Answering
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
PopQA
HierSearch
EM
61.6
104
29d ago
TriviaQA
Dep-Search
EM
72
81
29d ago
LoCoMo
ShardMemo
F1
0.6408
53
1mo ago
NQ
Tree-GRPO
Exact Match (EM)
51.7
44
1mo ago
TriviaQA (test)
InfoReasoner-3B
Accuracy
63.4
38
1mo ago
Single-hop QA Average
MemPO
F1 Score
59.61
35
1mo ago
Natural Questions (NQ) (test)
SE-Search-3B
EM
47.5
33
1mo ago
Trivia
Tree-GRPO
Exact Match
68.1
30
1mo ago
LoCoMo Single-Hop (test)
Reproduced Baselines
F1
37.9
24
1mo ago
MFQA en 16k
DCS
Overall Score
23.76
22
1mo ago
Qasper
Llama-3-8B-Instruct
Score
44.79
22
1mo ago
NarrativeQA
DCS
Score
23.89
22
1mo ago
MFQA en
Llama-3-8B-Instruct with DCS
Score
45.83
22
1mo ago
PopQA (test)
InfoReasoner-3B
Accuracy
44.2
21
1mo ago
NQ (Natural Questions) (test)
InfoReasoner-3B
Accuracy
45.3
21
1mo ago
SQuAD
IN-CONTEXT
F1 Score
86.8
21
1mo ago
NQ (Natural Questions) in-domain (test)
ARPO
EM
31.45
20
1mo ago
Factrecall en
LM-infinite
Score
31.36
17
1mo ago
Loogle SD
DCS
Score
45.1
17
1mo ago
Natural Questions
Amber
Accuracy
47.8
15
1mo ago
PopQA 2018 Wikipedia dump (dev)
MR-Search
Accuracy
47.2
14
1mo ago
TriviaQA 2018 Wikipedia dump (dev)
MR-Search
Accuracy
66.6
14
1mo ago
NQ (Natural Questions) 2018 Wikipedia dump (dev)
MR-Search
Accuracy
50.2
14
1mo ago
Single-Hop QA NQ, TriviaQA, PopQA
SLEA-RL
NQ Score
48.5
13
29d ago
Complex-TR ODQA 1.0 (test)
FiD-PIT + Refine
Set Accuracy
49
13
1mo ago
Showing 25 of 36 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs