Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Universal Sentence Encoder

About

We present models for encoding sentences into embedding vectors that specifically target transfer learning to other NLP tasks. The models are efficient and result in accurate performance on diverse transfer tasks. Two variants of the encoding models allow for trade-offs between accuracy and compute resources. For both variants, we investigate and report the relationship between model complexity, resource consumption, the availability of transfer task training data, and task performance. Comparisons are made with baselines that use word level transfer learning via pretrained word embeddings as well as baselines do not use any transfer learning. We find that transfer learning using sentence embeddings tends to outperform word level transfer. With transfer learning via sentence embeddings, we observe surprisingly good performance with minimal amounts of supervised training data for a transfer task. We obtain encouraging results on Word Embedding Association Tests (WEAT) targeted at detecting model bias. Our pre-trained sentence encoding models are made freely available for download and on TF Hub.

Daniel Cer, Yinfei Yang, Sheng-yi Kong, Nan Hua, Nicole Limtiaco, Rhomni St. John, Noah Constant, Mario Guajardo-Cespedes, Steve Yuan, Chris Tar, Yun-Hsuan Sung, Brian Strope, Ray Kurzweil• 2018

Related benchmarks

TaskDatasetResultRank
Semantic Textual SimilaritySTS tasks (STS12, STS13, STS14, STS15, STS16, STS-B, SICK-R) various (test)
STS12 Score64.49
393
Subjectivity ClassificationSubj
Accuracy93.9
266
Question ClassificationTREC
Accuracy98.07
205
Opinion Polarity DetectionMPQA
Accuracy88.14
154
Sentiment ClassificationMR
Accuracy81.59
148
Sentiment ClassificationCR
Accuracy87.45
142
Semantic Textual SimilaritySTS 2014
Spearman Correlation0.7492
35
Sentence Representation EvaluationSentEval (test)
MR Accuracy80.09
28
Sentiment ClassificationSST
Accuracy87.21
24
Semantic Textual SimilaritySTS Benchmark (dev)
Pearson Correlation (r)0.814
21
Showing 10 of 13 rows

Other info

Code

Follow for update