Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval
About
We propose a simple and efficient multi-hop dense retrieval approach for answering complex open-domain questions, which achieves state-of-the-art performance on two multi-hop datasets, HotpotQA and multi-evidence FEVER. Contrary to previous work, our method does not require access to any corpus-specific information, such as inter-document hyperlinks or human-annotated entity markers, and can be applied to any unstructured text corpus. Our system also yields a much better efficiency-accuracy trade-off, matching the best published accuracy on HotpotQA while being 10 times faster at inference time.
Wenhan Xiong, Xiang Lorraine Li, Srini Iyer, Jingfei Du, Patrick Lewis, William Yang Wang, Yashar Mehdad, Wen-tau Yih, Sebastian Riedel, Douwe Kiela, Barlas O\u{g}uz• 2020
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-hop Question Answering | HotpotQA (test) | -- | 198 | |
| Multi-hop Question Answering | HotpotQA fullwiki setting (test) | Answer F175.3 | 64 | |
| Answer extraction and supporting sentence prediction | HotpotQA fullwiki (test) | Answer EM62.3 | 48 | |
| Question Answering | HotpotQA (dev) | Answer F175.1 | 43 | |
| Multi-hop Question Answering | HotpotQA fullwiki setting (dev) | Answer F175.6 | 38 | |
| Question Answering | HotpotQA (test) | Ans F175.3 | 37 | |
| Page-level retrieval | KILT (test) | WoW Score36.5 | 28 | |
| Retrieval | HotpotQA | -- | 24 | |
| Question Answering | HotpotQA full wiki (dev) | F175.1 | 20 | |
| Supporting Fact Prediction | HotpotQA full wiki (dev) | F1 Score79.4 | 19 |
Showing 10 of 20 rows