Sentence Embeddings in NLI with Iterative Refinement Encoders
About
Sentence-level representations are necessary for various NLP tasks. Recurrent neural networks have proven to be very effective in learning distributed representations and can be trained efficiently on natural language inference tasks. We build on top of one such model and propose a hierarchy of BiLSTM and max pooling layers that implements an iterative refinement strategy and yields state of the art results on the SciTail dataset as well as strong results for SNLI and MultiNLI. We can show that the sentence embeddings learned in this way can be utilized in a wide variety of transfer learning tasks, outperforming InferSent on 7 out of 10 and SkipThought on 8 out of 9 SentEval sentence embedding evaluation tasks. Furthermore, our model beats the InferSent model in 8 out of 10 recently published SentEval probing tasks designed to evaluate sentence embeddings' ability to capture some of the important linguistic properties of sentences.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Natural Language Inference | SNLI (test) | Accuracy86.6 | 681 | |
| Sentiment Classification | MR (test) | Accuracy81.7 | 142 | |
| Subjectivity Classification | Subj (test) | Accuracy93.7 | 125 | |
| Natural Language Inference | SciTail (test) | Accuracy86 | 86 | |
| Natural Language Inference | MultiNLI matched (test) | Accuracy73.7 | 65 | |
| Semantic Relatedness | SICK 2014 (test) | Pearson's r0.876 | 56 | |
| Natural Language Inference | MultiNLI mismatched (test) | Accuracy73 | 56 | |
| Semantic Textual Similarity | STS 2014 | Spearman Correlation0.68 | 35 | |
| Textual Entailment | SICK (test) | Accuracy85.3 | 21 | |
| Paraphrase Detection | Microsoft Paraphrase Corpus (MRPC) (test) | Accuracy76.7 | 15 |