Universal Sentence Encoder

About

We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources. For both variants, we investigate and report the relationship between model complexity, resource consumption, the availability of transfer task training data, and task performance. Comparisons are made with baselines that use word level transfer learning via pretrained word embeddings as well as baselines do not use any transfer learning. We find that transfer learning using sentence embeddings tends to outperform word level transfer. With transfer learning via sentence embeddings, we observe surprisingly good performance with minimal amounts of supervised training data for a transfer task. We obtain encouraging results on Word Embedding Association Tests (WEAT) targeted at detecting model bias. Our pre-trained sentence encoding models are made freely available for download and on TF Hub.

Daniel Cer, Yinfei Yang, Sheng-yi Kong, Nan Hua, Nicole Limtiaco, Rhomni St. John, Noah Constant, Mario Guajardo-Cespedes, Steve Yuan, Chris Tar, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil• 2018

Related benchmarks

Task	Dataset	Result
Semantic Textual Similarity	STS tasks (STS12, STS13, STS14, STS15, STS16, STS-B, SICK-R) various (test)	STS12 Score64.49	412
Subjectivity Classification	Subj	Accuracy93.9	343
Question Classification	TREC	Accuracy98.07	262
Opinion Polarity Detection	MPQA	Accuracy88.14	158
Sentiment Classification	MR	Accuracy81.59	148
Sentiment Classification	CR	Accuracy87.45	142
Semantic Textual Similarity	STS Benchmark (test)	Pearson Correlation (r)0.782	46
Semantic Textual Similarity	STS 2014	Spearman Correlation0.7492	39
Sentence Representation Evaluation	SentEval (test)	MR Accuracy80.09	28
Sentiment Classification	SST	Accuracy87.21	24

Showing 10 of 13 rows

Other info

Code

Follow for update

@wizwand_team Discord