Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

SemEval-2017 Task 1: Semantic Textual Similarity - Multilingual and Cross-lingual Focused Evaluation

About

Semantic Textual Similarity (STS) measures the meaning similarity of sentences. Applications include machine translation (MT), summarization, generation, question answering (QA), short answer grading, semantic search, dialog and conversational systems. The STS shared task is a venue for assessing the current state-of-the-art. The 2017 task focuses on multilingual and cross-lingual pairs with one sub-track exploring MT quality estimation (MTQE) data. The task obtained strong participation from 31 teams, with 17 participating in all language tracks. We summarize performance and review a selection of well performing methods. Analysis highlights common errors, providing insight into the limitations of existing models. To support ongoing work on semantic representations, the STS Benchmark is introduced as a new shared training and evaluation set carefully selected from the corpus of English STS shared task data (2012-2017).

Daniel Cer, Mona Diab, Eneko Agirre, I\~nigo Lopez-Gazpio, Lucia Specia• 2017

Related benchmarks

TaskDatasetResultRank
Sentiment AnalysisSST5
Spearman's rho (x100)36.75
23
Question ClassificationTREC
Spearman's rho (x100)34.64
23
Sentiment AnalysisMR
Spearman's rho0.4603
23
Sentiment AnalysisSST2
Spearman Rho (x100)40.41
23
Citation Intent ClassificationSciCite
Spearman Correlation0.3299
23
Opinion Polarity DetectionMPQA--
12
Sentiment AnalysisCR
Spearman Correlation52.78
11
Semantic Textual SimilaritySTS SemEval-2017 Task 1 (test)
Pearson Correlation0.6045
8
Semantic Textual SimilaritySemEval Task 1 Spanish 2017 (Track 3)
Pearson R (x100)71.17
8
Semantic Textual SimilaritySemEval Task 1 English Track 5 2017
Pearson Correlation (R)0.7278
8
Showing 10 of 10 rows

Other info

Follow for update