Cross-lingual Entity Alignment via Joint Attribute-Preserving Embedding
About
Entity alignment is the task of finding entities in two knowledge bases (KBs) that represent the same real-world object. When facing KBs in different natural languages, conventional cross-lingual entity alignment methods rely on machine translation to eliminate the language barriers. These approaches often suffer from the uneven quality of translations between languages. While recent embedding-based techniques encode entities and relationships in KBs and do not need machine translation for cross-lingual entity alignment, a significant number of attributes remain largely unexplored. In this paper, we propose a joint attribute-preserving embedding model for cross-lingual entity alignment. It jointly embeds the structures of two KBs into a unified vector space and further refines it by leveraging attribute correlations in the KBs. Our experimental results on real-world datasets show that this approach significantly outperforms the state-of-the-art embedding approaches for cross-lingual entity alignment and could be complemented with methods based on machine translation.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Entity Alignment | DBP15K FR-EN | Hits@10.324 | 158 | |
| Entity Alignment | DBP15K JA-EN (test) | Hits@136.3 | 149 | |
| Entity Alignment | DBP15K ZH-EN | H@141.2 | 143 | |
| Entity Alignment | DBP15K ZH-EN (test) | Hits@141.2 | 134 | |
| Entity Alignment | DBP15K FR-EN (test) | Hits@132.4 | 133 | |
| Entity Alignment | DBP15K JA-EN | Hits@10.363 | 126 | |
| Entity Alignment | DWY100K DBP-YG | Hits@123.6 | 51 | |
| Entity Alignment | DBP ZH-EN 15K | Hits@10.412 | 47 | |
| Entity Alignment | DBP JA-EN 15K | Hits@136.3 | 40 | |
| Entity Alignment | DWY100K DBP-WD | Hits@10.318 | 36 |