A Semantic Distance Metric Learning approach for Lexical Semantic Change Detection
About
Detecting temporal semantic changes of words is an important task for various NLP applications that must make time-sensitive predictions. Lexical Semantic Change Detection (SCD) task involves predicting whether a given target word, $w$, changes its meaning between two different text corpora, $C_1$ and $C_2$. For this purpose, we propose a supervised two-staged SCD method that uses existing Word-in-Context (WiC) datasets. In the first stage, for a target word $w$, we learn two sense-aware encoders that represent the meaning of $w$ in a given sentence selected from a corpus. Next, in the second stage, we learn a sense-aware distance metric that compares the semantic representations of a target word across all of its occurrences in $C_1$ and $C_2$. Experimental results on multiple benchmark datasets for SCD show that our proposed method achieves strong performance in multiple languages. Additionally, our method achieves significant improvements on WiC benchmarks compared to a sense-aware encoder with conventional distance functions. Source code is available at https://github.com/LivNLP/svp-sdml .
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Lexical Semantic Change Detection | SemEval Task 1 Subtask 2 English 2020 | Spearman Correlation0.774 | 54 | |
| Lexical Semantic Change Detection | SemEval (test) | Accuracy (En)77.4 | 8 | |
| Lexical Semantic Change Detection | RuShiftEval (test) | Ru10.805 | 6 | |
| Word-in-Context (WiC) | XL-WiC all | Accuracy (De)84.9 | 6 | |
| Word-in-Context (WiC) | XL-WiC IV | Accuracy (De)85.1 | 6 | |
| Word-in-Context (WiC) | XL-WiC OOV | Accuracy (German)76.6 | 6 | |
| Word-in-Context (WiC) | MCL-WiC | Accuracy (En)90.3 | 4 | |
| Word-in-Context binary classification | AM2iCo (test) | Accuracy (De)85 | 4 |