Hierarchical Graph Network for Multi-hop Question Answering
About
In this paper, we present Hierarchical Graph Network (HGN) for multi-hop question answering. To aggregate clues from scattered texts across multiple paragraphs, a hierarchical graph is created by constructing nodes on different levels of granularity (questions, paragraphs, sentences, entities), the representations of which are initialized with pre-trained contextual encoders. Given this hierarchical graph, the initial node representations are updated through graph propagation, and multi-hop reasoning is performed via traversing through the graph edges for each subsequent sub-task (e.g., paragraph selection, supporting facts extraction, answer prediction). By weaving heterogeneous nodes into an integral unified graph, this hierarchical differentiation of node granularity enables HGN to support different question answering sub-tasks simultaneously. Experiments on the HotpotQA benchmark demonstrate that the proposed model achieves new state of the art, outperforming existing multi-hop QA approaches.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Multi-hop Question Answering | HotpotQA fullwiki setting (test) | Answer F171.41 | 64 | |
| Answer extraction and supporting sentence prediction | HotpotQA fullwiki (test) | Answer EM59.74 | 48 | |
| Question Answering | HotpotQA distractor (dev) | Answer F183.4 | 45 | |
| Question Answering | HotpotQA (test) | Ans F182.2 | 37 | |
| Question Answering | HotpotQA distractor setting (test) | Answer F182.2 | 34 | |
| Question Answering | HotpotQA (test) | EM0.692 | 12 | |
| Multi-hop Text QA | HotpotQA Full v1.1 (train) | F1 Score82.2 | 9 | |
| Conversational Question Answering | ShARC Long (dev) | Easy Accuracy61.1 | 7 | |
| Multi-hop Question Answering | HotpotQA blind (test) | Answer EM69.22 | 6 | |
| Question Answering | HotpotQA May 2020 (leaderboard) | F1 Score74.2 | 2 |