Are Negative Samples Necessary in Entity Alignment? An Approach with High Performance, Scalability and Robustness
About
Entity alignment (EA) aims to find the equivalent entities in different KGs, which is a crucial step in integrating multiple KGs. However, most existing EA methods have poor scalability and are unable to cope with large-scale datasets. We summarize three issues leading to such high time-space complexity in existing EA methods: (1) Inefficient graph encoders, (2) Dilemma of negative sampling, and (3) "Catastrophic forgetting" in semi-supervised learning. To address these challenges, we propose a novel EA method with three new components to enable high Performance, high Scalability, and high Robustness (PSR): (1) Simplified graph encoder with relational graph sampling, (2) Symmetric negative-free alignment loss, and (3) Incremental semi-supervised learning. Furthermore, we conduct detailed experiments on several public datasets to examine the effectiveness and efficiency of our proposed method. The experimental results show that PSR not only surpasses the previous SOTA in performance but also has impressive scalability and robustness.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Entity Alignment | DBP15K JA-EN (test) | Hits@190.8 | 149 | |
| Entity Alignment | DBP15K ZH-EN (test) | Hits@188.3 | 134 | |
| Entity Alignment | DBP15K FR-EN (test) | Hits@195.8 | 133 | |
| Entity Alignment | SRPRS | Time cost (s)75 | 59 | |
| Entity Alignment | DBP15K | Runtime (s)88 | 59 | |
| Entity Alignment | SRPRS FR-EN (test) | Hits@10.808 | 57 | |
| Entity Alignment | SRPRS DE-EN (test) | Hits@10.881 | 57 | |
| Entity Alignment | DWY100K | Runtime (s)603 | 44 | |
| Entity Alignment | DWYDBP-WD 100K (test) | H@188.1 | 20 | |
| Entity Alignment | DWYDBP-YG 100K (test) | H@10.892 | 20 |