Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Semantic Textual Similarity on STS-16
Loading...
77.63
Spearman Rho (x100)
BERTlarge-flow (target)
36.446
47.138
57.83
68.522
Nov 2, 2020
Spearman Rho (x100)
Updated 4d ago
Evaluation Results
Method
Method
Links
Spearman Rho (x100)
BERTlarge-flow (target)
Backbone=BERT-large, P...
2020.11
77.63
BERTbase-flow (target)
Backbone=BERT-base, Po...
2020.11
75.37
BERTlarge-flow (NLI*)
Backbone=BERT-large, P...
2020.11
74.47
BERTbase-flow (NLI*)
Backbone=BERT-base, Po...
2020.11
71.84
BERTlarge-last2avg
Backbone=BERT-large, P...
2020.11
70.32
BERTbase-last2avg
Backbone=BERT-base, Po...
2020.11
69.81
BERTbase
Backbone=BERT-base, Po...
2020.11
65.19
Avg. GloVe embeddings
Backbone=GloVe, Poolin...
2020.11
63.66
BERTlarge
Backbone=BERT-large, P...
2020.11
61.63
Avg. BERT embeddings
Backbone=BERT, Pooling...
2020.11
61.06
BERT CLS-vector
Backbone=BERT, Pooling...
2020.11
38.03
Feedback
Search any
task
Search any
task