Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Semantic Textual Similarity on STS-B (Leakage, Confidence, Pearson evaluation)

2.68Leakage

SPARSE

-0.682822.016144.71567.4139Feb 6, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
2.681.780.4817
2026.02
19.031.980.4005
2026.02
20.751.890.4003
2026.02
32.398.070.7614
2026.02
49.9513.970.7185
2026.02
53.7915.40.7187
2026.02
64.7936.330.81
2026.02
69.1538.440.8095
2026.02
71.8241.760.8095
2026.02
73.148.630.8091
2026.02
74.9849.280.8108
2026.02
76.1552.360.8108
2026.02
76.4253.980.8081
2026.02
77.454.020.8095
2026.02
78.9456.840.8095
2026.02
86.7566.570.8064