A Convolutional Neural Network for Modelling Sentences
About
The ability to accurately represent sentences is central to language understanding. We describe a convolutional architecture dubbed the Dynamic Convolutional Neural Network (DCNN) that we adopt for the semantic modelling of sentences. The network uses Dynamic k-Max Pooling, a global pooling operation over linear sequences. The network handles input sentences of varying length and induces a feature graph over the sentence that is capable of explicitly capturing short and long-range relations. The network does not rely on a parse tree and is easily applicable to any language. We test the DCNN in four experiments: small scale binary and multi-class sentiment prediction, six-way question classification and Twitter sentiment prediction by distant supervision. The network achieves excellent performance in the first three tasks and a greater than 25% error reduction in the last task with respect to the strongest baseline.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Subjectivity Classification | Subj | Accuracy93 | 266 | |
| Question Classification | TREC | Accuracy93 | 205 | |
| Sentiment Classification | SST-2 | Accuracy86.8 | 174 | |
| Sentiment Analysis | SST-5 (test) | Accuracy48.5 | 173 | |
| Question Classification | TREC (test) | Accuracy93 | 124 | |
| Text Classification | SST-2 | Accuracy86.8 | 121 | |
| Sentiment Classification | Stanford Sentiment Treebank SST-2 (test) | Accuracy86.8 | 99 | |
| Text Classification | SST-1 | Accuracy48.5 | 45 | |
| 6-way question classification | TREC 6-class (test) | Accuracy93 | 23 | |
| Sentiment Analysis | SST-1 (test) | Accuracy48.5 | 10 |