Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-hop Question Answering on 2WikiMultihopQA blind (test)
Loading...
0.8847
Answer EM
Beam Retrieval
0.688244
0.739247
0.79025
0.841253
Aug 17, 2023
Answer EM
Answer F1
Supporting EM
Supporting F1
Updated 1mo ago
Evaluation Results
Method
Method
Links
Answer EM
Answer F1
Supporting EM
Supporting F1
Beam Retrieval
beam size=1
2023.08
0.8847
0.9087
0.9587
0.9815
NA-Reviewer
2023.08
0.7673
0.8191
0.8961
0.9431
BigBird-base model
2023.08
0.7405
0.7968
0.7714
0.9213
CRERC
2023.08
0.6958
0.7233
0.8286
0.9068
Feedback
Search any
task
Search any
task