Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

2WikiMQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-hop Question Answering2WikiMQA
F1 Score76.4
154
Question Answering2WikiMQA
F174.9
44
Multi-hop Reasoning2WikiMQA IRCoT 500 samples (test)
ACC52.8
27
Question Answering2WikiMQA (test)
EM35.9
18
Retrieval2WikiMQA (test)
Recall@K69.7
8
Multi-hop Question Answering2WikiMQA (test)
Exact Match48.6
7
Question Answering2WikiMQA (sampled)
Accuracy0.63
4
Showing 7 of 7 rows