Convolutional Neural Networks for Sentence Classification
About
We report on a series of experiments with convolutional neural networks (CNN) trained on top of pre-trained word vectors for sentence-level classification tasks. We show that a simple CNN with little hyperparameter tuning and static vectors achieves excellent results on multiple benchmarks. Learning task-specific vectors through fine-tuning offers further gains in performance. We additionally propose a simple modification to the architecture to allow for the use of both task-specific and static vectors. The CNN models discussed herein improve upon the state of the art on 4 out of 7 tasks, which include sentiment analysis and question classification.
Yoon Kim• 2014
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Natural Language Inference | SNLI (test) | Accuracy83.2 | 690 | |
| Subjectivity Classification | Subj | Accuracy93.4 | 329 | |
| Document Classification | RVL-CDIP (test) | Accuracy80.5 | 306 | |
| Question Classification | TREC | Accuracy97.32 | 259 | |
| Text Classification | AG News (test) | Accuracy90.3 | 228 | |
| Text Classification | TREC | Accuracy93.6 | 207 | |
| Text Classification | SST-2 (test) | Accuracy87.2 | 185 | |
| Sentiment Classification | SST-2 | Accuracy88.1 | 184 | |
| Natural Language Inference | SNLI | Accuracy82.1 | 180 | |
| Sentiment Analysis | SST-5 (test) | Accuracy48 | 173 |
Showing 10 of 176 rows
...