Learning to Generate Reviews and Discovering Sentiment

About

We explore the properties of byte-level recurrent language models. When given sufficient amounts of capacity, training data, and compute time, the representations learned by these models include disentangled features corresponding to high-level concepts. Specifically, we find a single unit which performs sentiment analysis. These representations, learned in an unsupervised manner, achieve state of the art on the binary subset of the Stanford Sentiment Treebank. They are also very data efficient. When using only a handful of labeled examples, our approach matches the performance of strong baselines trained on full datasets. We also demonstrate the sentiment unit has a direct influence on the generative process of the model. Simply fixing its value to be positive or negative generates samples with the corresponding positive or negative sentiment.

Alec Radford, Rafal Jozefowicz, Ilya Sutskever• 2017

Related benchmarks

Task	Dataset	Result
Subjectivity Classification	Subj	Accuracy94.7	343
Sentiment Analysis	IMDB (test)	Accuracy92.9	306
Text Classification	TREC	Accuracy90.4	281
Text Classification	SST-2 (test)	Accuracy91.8	185
Text Classification	MR	Accuracy86.8	174
Opinion Polarity Detection	MPQA	Accuracy88.5	158
Text Classification	MR (test)	Accuracy86.9	155
Sentiment Classification	IMDB (test)	Error Rate7.12	144
Sentiment Classification	CR	Accuracy91.4	142
Text Classification	IMDB	Accuracy92.2	119

Showing 10 of 18 rows

Other info

Follow for update

@wizwand_team Discord