Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

STS

Benchmarks

Task NameDataset NameSOTA ResultTrend
Semantic Textual SimilaritySTS tasks (STS12, STS13, STS14, STS15, STS16, STS-B, SICK-R)
STS12 Score80.67
195
Semantic Textual SimilarityEnglish STS
Average Score83.07
68
Semantic Textual SimilaritySTS (Semantic Textual Similarity) 2012-2016 (test)
STS-12 Score81.08
57
Semantic Textual SimilaritySTS 2014
Spearman Correlation0.8877
35
Sentence RelatednessSTS 2014
News Spearman0.69
30
Semantic Textual SimilaritySTS-12
Spearman Correlation (rho)0.7154
23
Privacy-utility tradeoffSTS12
Leakage4.34
16
Semantic Textual SimilaritySTS Benchmark (test)
Pearson Correlation (r)0.919
16
Semantic Textual SimilaritySTS16 (test)
Spearman Corr77.18
12
Semantic Textual SimilaritySTS15 (test)
Spearman Correlation0.8049
12
Semantic Textual SimilaritySTS14 (test)
Spearman Correlation0.7319
12
Semantic Textual SimilaritySTS13 (test)
Spearman Correlation81.26
12
Semantic Textual SimilaritySTS-16
Spearman Rho (x100)77.63
11
Semantic Textual SimilaritySTS-15
Spearman's Rho0.7492
11
Semantic Textual SimilaritySTS-13
Spearman's Rho73.39
11
Medical Image SegmentationSTS X-ray (unseen)
DSC73.2
10
Lung tumor segmentationSTS (test)
IoU60.33
9
Semantic Textual SimilaritySTS English (test)
Spearman's ρ76.9
9
Semantic Textual SimilaritySTS SemEval-2017 Task 1 (test)
Pearson Correlation0.744
8
Semantic Textual SimilaritySTS12
Downstream Performance74.25
5
Transfer Learning EvaluationSTS Transfer Robustness (test val)
MRPC62.2
4
Sentence RankingSTS16
KCC58.9
3
Sentence RankingSTS15
KCC52
3
Sentence RankingSTS14
KCC44.53
3
Sentence RankingSTS13
KCC0.4626
3
Showing 25 of 26 rows