Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Adversarial Propagation and Zero-Shot Cross-Lingual Transfer of Word Vector Specialization

About

Semantic specialization is the process of fine-tuning pre-trained distributional word vectors using external lexical knowledge (e.g., WordNet) to accentuate a particular semantic relation in the specialized vector space. While post-processing specialization methods are applicable to arbitrary distributional vectors, they are limited to updating only the vectors of words occurring in external lexicons (i.e., seen words), leaving the vectors of all other words unchanged. We propose a novel approach to specializing the full distributional vocabulary. Our adversarial post-specialization method propagates the external lexical knowledge to the full distributional space. We exploit words seen in the resources as training examples for learning a global specialization function. This function is learned by combining a standard L2-distance loss with an adversarial loss: the adversarial component produces more realistic output vectors. We show the effectiveness and robustness of the proposed method across three languages and on three tasks: word similarity, dialog state tracking, and lexical simplification. We report consistent improvements over distributional word vectors and vectors specialized by other state-of-the-art specialization frameworks. Finally, we also propose a cross-lingual transfer method for zero-shot specialization which successfully specializes a full target distributional space without any lexical knowledge in the target language and without any bilingual data.

Edoardo Maria Ponti, Ivan Vuli\'c, Goran Glava\v{s}, Nikola Mrk\v{s}i\'c, Anna Korhonen• 2018

Related benchmarks

TaskDatasetResultRank
Dialogue State TrackingWOZ 2.0 (test)
Joint Goal Accuracy83.6
65
Word SimilaritySimLex999 (test)
Spearman Correlation0.789
30
Word SimilaritySimVerb-3500 (test)
Spearman Correlation0.764
27
Lexical Text SimplificationLS dataset standard (test)
Accuracy73.9
12
Word SimilarityEnglish SimLex-999 Disjoint setting (test)
Spearman's Rho0.652
12
Word SimilarityEnglish SimVerb-3500 Disjoint setting (test)
Spearman's Rho0.552
12
Dialog State TrackingWOZ Italian (IT) (test)
JGA71.4
2
Lexical SimplificationSIMPITIKI Italian (IT) (test)
Accuracy39.2
2
Word SimilaritySimLex-999 Italian (IT) (test)
Spearman's Rho0.431
2
Word SimilaritySimLex-999 German (DE) (test)
Spearman's Rho0.525
2
Showing 10 of 10 rows

Other info

Code

Follow for update