Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Convolutional Neural Networks for Sentence Classification

About

We report on a series of experiments with convolutional neural networks (CNN) trained on top of pre-trained word vectors for sentence-level classification tasks. We show that a simple CNN with little hyperparameter tuning and static vectors achieves excellent results on multiple benchmarks. Learning task-specific vectors through fine-tuning offers further gains in performance. We additionally propose a simple modification to the architecture to allow for the use of both task-specific and static vectors. The CNN models discussed herein improve upon the state of the art on 4 out of 7 tasks, which include sentiment analysis and question classification.

Yoon Kim• 2014

Related benchmarks

TaskDatasetResultRank
Natural Language InferenceSNLI (test)
Accuracy83.2
690
Subjectivity ClassificationSubj
Accuracy93.4
329
Document ClassificationRVL-CDIP (test)
Accuracy80.5
306
Question ClassificationTREC
Accuracy97.32
259
Text ClassificationAG News (test)
Accuracy90.3
228
Text ClassificationTREC
Accuracy93.6
207
Text ClassificationSST-2 (test)
Accuracy87.2
185
Sentiment ClassificationSST-2
Accuracy88.1
184
Natural Language InferenceSNLI
Accuracy82.1
180
Sentiment AnalysisSST-5 (test)
Accuracy48
173
Showing 10 of 176 rows
...

Other info

Code

Follow for update