Convolutional Neural Networks for Sentence Classification
About
We report on a series of experiments with convolutional neural networks (CNN) trained on top of pre-trained word vectors for sentence-level classification tasks. We show that a simple CNN with little hyperparameter tuning and static vectors achieves excellent results on multiple benchmarks. Learning task-specific vectors through fine-tuning offers further gains in performance. We additionally propose a simple modification to the architecture to allow for the use of both task-specific and static vectors. The CNN models discussed herein improve upon the state of the art on 4 out of 7 tasks, which include sentiment analysis and question classification.
Yoon Kim• 2014
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Natural Language Inference | SNLI (test) | Accuracy83.2 | 681 | |
| Document Classification | RVL-CDIP (test) | Accuracy80.5 | 306 | |
| Subjectivity Classification | Subj | Accuracy93.4 | 266 | |
| Text Classification | AG News (test) | -- | 210 | |
| Question Classification | TREC | Accuracy97.32 | 205 | |
| Text Classification | SST-2 (test) | Accuracy87.2 | 185 | |
| Text Classification | TREC | Accuracy93.6 | 179 | |
| Sentiment Classification | SST-2 | Accuracy88.1 | 174 | |
| Natural Language Inference | SNLI | Accuracy82.1 | 174 | |
| Sentiment Analysis | SST-5 (test) | Accuracy48 | 173 |
Showing 10 of 150 rows
...