Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

A Convolutional Neural Network for Modelling Sentences

About

The ability to accurately represent sentences is central to language understanding. We describe a convolutional architecture dubbed the Dynamic Convolutional Neural Network (DCNN) that we adopt for the semantic modelling of sentences. The network uses Dynamic k-Max Pooling, a global pooling operation over linear sequences. The network handles input sentences of varying length and induces a feature graph over the sentence that is capable of explicitly capturing short and long-range relations. The network does not rely on a parse tree and is easily applicable to any language. We test the DCNN in four experiments: small scale binary and multi-class sentiment prediction, six-way question classification and Twitter sentiment prediction by distant supervision. The network achieves excellent performance in the first three tasks and a greater than 25% error reduction in the last task with respect to the strongest baseline.

Nal Kalchbrenner, Edward Grefenstette, Phil Blunsom• 2014

Related benchmarks

TaskDatasetResultRank
Subjectivity ClassificationSubj
Accuracy93
343
Question ClassificationTREC
Accuracy93
262
Sentiment ClassificationSST-2
Accuracy86.8
190
Sentiment AnalysisSST-5 (test)
Accuracy48.5
177
Text ClassificationSST-2
Accuracy86.8
136
Question ClassificationTREC (test)
Accuracy93
128
Sentiment ClassificationStanford Sentiment Treebank SST-2 (test)
Accuracy86.8
105
Text ClassificationSST-1
Accuracy48.5
45
6-way question classificationTREC 6-class (test)
Accuracy93
41
Sentiment AnalysisSST-1 (test)
Accuracy48.5
10
Showing 10 of 12 rows

Other info

Follow for update