Bilateral Multi-Perspective Matching for Natural Language Sentences
About
Natural language sentence matching is a fundamental technology for a variety of tasks. Previous approaches either match sentences from a single direction or only apply single granular (word-by-word or sentence-by-sentence) matching. In this work, we propose a bilateral multi-perspective matching (BiMPM) model under the "matching-aggregation" framework. Given two sentences $P$ and $Q$, our model first encodes them with a BiLSTM encoder. Next, we match the two encoded sentences in two directions $P \rightarrow Q$ and $P \leftarrow Q$. In each matching direction, each time step of one sentence is matched against all time-steps of the other sentence from multiple perspectives. Then, another BiLSTM layer is utilized to aggregate the matching results into a fix-length matching vector. Finally, based on the matching vector, the decision is made through a fully connected layer. We evaluate our model on three tasks: paraphrase identification, natural language inference and answer sentence selection. Experimental results on standard benchmark datasets show that our model achieves the state-of-the-art performance on all tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Natural Language Inference | SNLI (test) | Accuracy88.8 | 681 | |
| Natural Language Inference | SNLI (train) | Accuracy93.2 | 154 | |
| Answer Selection | WikiQA (test) | MAP0.718 | 149 | |
| Multi-turn Response Selection | Ubuntu Dialogue Corpus V1 (test) | R10@166.5 | 102 | |
| Paraphrase Identification | Quora Question Pairs (test) | Accuracy88.2 | 72 | |
| Response Ranking | Ubuntu Dialog Corpus v1 (test) | Recall@1 (1/2)89.7 | 16 | |
| Answer Sentence Selection | TREC-QA clean-version | MAP80.2 | 14 | |
| Paraphrase Identification | Quora Question Pairs (dev) | Accuracy88.69 | 14 | |
| Multi-turn Response Selection | Ubuntu Dialogue Corpus V1 | Recall@1 (Pool 10)66.5 | 14 | |
| Paraphrase Identification | Quora Question Pairs | Accuracy83.2 | 14 |