Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Supporting Fact Prediction on HotpotQA distractor (dev)
Loading...
89
F1 Score
Gated Memory Flow
65.808
71.829
77.85
83.871
Nov 24, 2019
Apr 23, 2020
Sep 21, 2020
Feb 19, 2021
Jul 20, 2021
Dec 18, 2021
May 18, 2022
F1 Score
EM
Updated 4d ago
Evaluation Results
Method
Method
Links
F1 Score
EM
Gated Memory Flow
2022.05
89
64.7
PATHFiD+
cross_passage_interact...
2022.05
88.7
64.9
SAE
model_size=large
2022.05
87.4
63.3
PATHFiD
2022.05
85.7
59.3
SAE
model_size=base
2022.05
85.3
58.1
Recurrent Graph-based Retrieval
Reader=BERT wwm
2019.11
85.2
58.6
Graph Recurrent Retriever
variant=wwm
2022.05
85.2
58.6
QFE
2019.11
84.7
58.8
QFE
2022.05
84.7
58.8
Recurrent Graph-based Retrieval
Reader=BERT base
2019.11
84.6
57.4
Graph Recurrent Retriever
variant=base
2022.05
84.6
57.4
Baseline
2019.11
66.7
22
Baseline
2022.05
66.7
22
Feedback
Search any
task
Search any
task