Answering Any-hop Open-domain Questions with Iterative Document Reranking
About
Existing approaches for open-domain question answering (QA) are typically designed for questions that require either single-hop or multi-hop reasoning, which make strong assumptions of the complexity of questions to be answered. Also, multi-step document retrieval often incurs higher number of relevant but non-supporting documents, which dampens the downstream noise-sensitive reader module for answer extraction. To address these challenges, we propose a unified QA framework to answer any-hop open-domain questions, which iteratively retrieves, reranks and filters documents, and adaptively determines when to stop the retrieval process. To improve the retrieval accuracy, we propose a graph-based reranking model that perform multi-document interaction as the core of our iterative reranking framework. Our method consistently achieves performance comparable to or better than the state-of-the-art on both single-hop and multi-hop open-domain QA datasets, including Natural Questions Open, SQuAD Open, and HotpotQA.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-hop Question Answering | HotpotQA fullwiki setting (test) | Answer F175.9 | 64 | |
| Answer extraction and supporting sentence prediction | HotpotQA fullwiki (test) | Answer EM62.5 | 48 | |
| Question Answering | HotpotQA (dev) | Answer F176.9 | 43 | |
| Open-domain Question Answering | SQUAD Open (test) | Exact Match56.6 | 39 | |
| Multi-hop Question Answering | HotpotQA fullwiki setting (dev) | Answer F175.9 | 38 | |
| Open-domain Question Answering | NaturalQ-Open (test) | EM45.5 | 37 | |
| Question Answering | HotpotQA (test) | Ans F175.9 | 37 | |
| Question Answering | HotpotQA full wiki (dev) | F176.9 | 20 | |
| Supporting Fact Prediction | HotpotQA full wiki (dev) | F1 Score79.1 | 19 | |
| Retrieval | HotpotQA full wiki (dev) | PEM79.8 | 19 |