Skip-Thought Vectors

About

We describe an approach for unsupervised learning of a generic, distributed sentence encoder. Using the continuity of text from books, we train an encoder-decoder model that tries to reconstruct the surrounding sentences of an encoded passage. Sentences that share semantic and syntactic properties are thus mapped to similar vector representations. We next introduce a simple vocabulary expansion method to encode words that were not seen as part of training, allowing us to expand our vocabulary to a million words. After training our model, we extract and evaluate our vectors with linear models on 8 tasks: semantic relatedness, paraphrase detection, image-sentence ranking, question-type classification and 4 benchmark sentiment and subjectivity datasets. The end result is an off-the-shelf encoder that can produce highly generic sentence representations that are robust and perform well in practice. We will make our encoder publicly available.

Ryan Kiros, Yukun Zhu, Ruslan Salakhutdinov, Richard S. Zemel, Antonio Torralba, Raquel Urtasun, Sanja Fidler• 2015

Related benchmarks

Task	Dataset	Result
Natural Language Inference	SNLI (test)	Accuracy87.7	694
Subjectivity Classification	Subj	Accuracy94.2	343
Text Classification	TREC	Accuracy93	281
Question Classification	TREC	Accuracy92.2	262
Text Classification	MR	Accuracy80.4	174
Opinion Polarity Detection	MPQA	Accuracy89.3	158
Text Classification	MR (test)	Accuracy79.5	155
Subjectivity Classification	Subj (test)	Accuracy93.6	152
Sentiment Classification	MR	Accuracy76.5	148
Sentiment Classification	IMDB (test)	Error Rate17.42	144

Showing 10 of 51 rows

Other info

Follow for update

@wizwand_team Discord