Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-hop Question Answering on 2WikiMultihopQA blind (test)
Loading...
0.8847
Answer EM
Beam Retrieval
0.688244
0.739247
0.79025
0.841253
Aug 17, 2023
Answer EM
Answer F1
Supporting EM
Supporting F1
Updated 4d ago
Evaluation Results
Method
Method
Links
Answer EM
Answer F1
Supporting EM
Supporting F1
Beam Retrieval
beam size=1
2023.08
0.8847
0.9087
0.9587
0.9815
NA-Reviewer
2023.08
0.7673
0.8191
0.8961
0.9431
BigBird-base model
2023.08
0.7405
0.7968
0.7714
0.9213
CRERC
2023.08
0.6958
0.7233
0.8286
0.9068
Feedback
Search any
task
Search any
task