| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Semantic Textual Similarity | STS tasks (STS12, STS13, STS14, STS15, STS16, STS-B, SICK-R) | STS12 Score80.67 | 195 | |
| Semantic Textual Similarity | English STS | Average Score83.07 | 68 | |
| Semantic Textual Similarity | STS (Semantic Textual Similarity) 2012-2016 (test) | STS-12 Score81.08 | 57 | |
| Semantic Textual Similarity | STS 2014 | Spearman Correlation0.8877 | 35 | |
| Sentence Relatedness | STS 2014 | News Spearman0.69 | 30 | |
| Semantic Textual Similarity | STS-12 | Spearman Correlation (rho)0.7154 | 23 | |
| Privacy-utility tradeoff | STS12 | Leakage4.34 | 16 | |
| Semantic Textual Similarity | STS Benchmark (test) | Pearson Correlation (r)0.919 | 16 | |
| Semantic Textual Similarity | STS16 (test) | Spearman Corr77.18 | 12 | |
| Semantic Textual Similarity | STS15 (test) | Spearman Correlation0.8049 | 12 | |
| Semantic Textual Similarity | STS14 (test) | Spearman Correlation0.7319 | 12 | |
| Semantic Textual Similarity | STS13 (test) | Spearman Correlation81.26 | 12 | |
| Semantic Textual Similarity | STS-16 | Spearman Rho (x100)77.63 | 11 | |
| Semantic Textual Similarity | STS-15 | Spearman's Rho0.7492 | 11 | |
| Semantic Textual Similarity | STS-13 | Spearman's Rho73.39 | 11 | |
| Medical Image Segmentation | STS X-ray (unseen) | DSC73.2 | 10 | |
| Lung tumor segmentation | STS (test) | IoU60.33 | 9 | |
| Semantic Textual Similarity | STS English (test) | Spearman's ρ76.9 | 9 | |
| Semantic Textual Similarity | STS SemEval-2017 Task 1 (test) | Pearson Correlation0.744 | 8 | |
| Semantic Textual Similarity | STS12 | Downstream Performance74.25 | 5 | |
| Transfer Learning Evaluation | STS Transfer Robustness (test val) | MRPC62.2 | 4 | |
| Sentence Ranking | STS16 | KCC58.9 | 3 | |
| Sentence Ranking | STS15 | KCC52 | 3 | |
| Sentence Ranking | STS14 | KCC44.53 | 3 | |
| Sentence Ranking | STS13 | KCC0.4626 | 3 |