Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Answering Complex Open-domain Questions Through Iterative Query Generation

About

It is challenging for current one-step retrieve-and-read question answering (QA) systems to answer questions like "Which novel by the author of 'Armada' will be adapted as a feature film by Steven Spielberg?" because the question seldom contains retrievable clues about the missing entity (here, the author). Answering such a question requires multi-hop reasoning where one must gather information about the missing entity (or facts) to proceed with further reasoning. We present GoldEn (Gold Entity) Retriever, which iterates between reading context and retrieving more supporting documents to answer open-domain multi-hop questions. Instead of using opaque and computationally expensive neural retrieval models, GoldEn Retriever generates natural language search queries given the question and available context, and leverages off-the-shelf information retrieval systems to query for missing entities. This allows GoldEn Retriever to scale up efficiently for open-domain multi-hop reasoning while maintaining interpretability. We evaluate GoldEn Retriever on the recently proposed open-domain multi-hop QA dataset, HotpotQA, and demonstrate that it outperforms the best previously published model despite not using pretrained language models such as BERT.

Peng Qi, Xiaowen Lin, Leo Mehr, Zijian Wang, Christopher D. Manning• 2019

Related benchmarks

TaskDatasetResultRank
Multi-hop Question AnsweringHotpotQA fullwiki setting (test)
Answer F148.6
64
Answer extraction and supporting sentence predictionHotpotQA fullwiki (test)
Answer EM37.9
48
Multi-hop Question AnsweringHotpotQA fullwiki setting (dev)
Answer F149.8
38
Question AnsweringHotpotQA (test)
Ans F149.8
37
Question AnsweringHotpotQA full wiki (dev)
F149.8
20
RetrievalHotpotQA full wiki (dev)
PEM47.77
19
Supporting Fact PredictionHotpotQA full wiki (dev)
F1 Score64.6
19
Question AnsweringHotpotQA Full Wiki hidden (test)
F148.6
12
Supporting Facts PredictionHotpotQA Full Wiki hidden (test)
F1 Score64.2
11
Showing 9 of 9 rows

Other info

Code

Follow for update