Dynamic Self-Attention : Computing Attention over Words Dynamically for Sentence Embedding
About
In this paper, we propose Dynamic Self-Attention (DSA), a new self-attention mechanism for sentence embedding. We design DSA by modifying dynamic routing in capsule network (Sabouretal.,2017) for natural language processing. DSA attends to informative words with a dynamic weight vector. We achieve new state-of-the-art results among sentence encoding methods in Stanford Natural Language Inference (SNLI) dataset with the least number of parameters, while showing comparative results in Stanford Sentiment Treebank (SST) dataset.
Deunsol Yoon, Dongbok Lee, SangKeun Lee• 2018
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Natural Language Inference | SNLI (test) | Accuracy87.4 | 681 | |
| Sentiment Analysis | SST-5 (test) | Accuracy50.6 | 173 | |
| Sentiment Analysis | SST-2 (test) | Accuracy88.5 | 136 | |
| Natural Language Inference | SNLI 1.0 (test) | Accuracy87.4 | 19 | |
| Sentiment Analysis | Stanford Sentiment Treebank (SST) (test) | Accuracy50.6 | 10 | |
| Natural Language Inference | SNLI 1.0 (train) | Accuracy89 | 9 |
Showing 6 of 6 rows