Stochastic Answer Networks for Machine Reading Comprehension
About
We propose a simple yet robust stochastic answer network (SAN) that simulates multi-step reasoning in machine reading comprehension. Compared to previous work such as ReasoNet which used reinforcement learning to determine the number of steps, the unique feature is the use of a kind of stochastic prediction dropout on the answer module (final layer) of the neural network during the training. We show that this simple trick improves robustness and achieves results competitive to the state-of-the-art on the Stanford Question Answering Dataset (SQuAD), the Adversarial SQuAD, and the Microsoft MAchine Reading COmprehension Dataset (MS MARCO).
Xiaodong Liu, Yelong Shen, Kevin Duh, Jianfeng Gao• 2017
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Question Answering | SQuAD v1.1 (dev) | F1 Score85.866 | 375 | |
| Question Answering | SQuAD v1.1 (test) | F1 Score86.496 | 260 | |
| Question Answering | SQuAD (test) | F184.4 | 111 | |
| Machine Reading Comprehension | SQuAD 2.0 (test) | EM68.6 | 51 | |
| Machine Reading Comprehension | SQuAD 1.1 (dev) | EM78.6 | 48 | |
| Machine Reading Comprehension | SQuAD 1.1 (test) | EM79.6 | 46 | |
| Question Answering | SQuAD 2.0 Sep 9, 2018 (test) | EM71.3 | 17 | |
| Reading Comprehension | Adversarial SQuAD AddOneSent v1.1 (test) | F1 Score56.5 | 10 | |
| Reading Comprehension | Adversarial SQuAD AddSent v1.1 (test) | F146.6 | 10 | |
| Machine Reading Comprehension | AddSent (adversarial) | F1 Score46.6 | 6 |
Showing 10 of 12 rows