Joint Entity Linking with Deep Reinforcement Learning
About
Entity linking is the task of aligning mentions to corresponding entities in a given knowledge base. Previous studies have highlighted the necessity for entity linking systems to capture the global coherence. However, there are two common weaknesses in previous global models. First, most of them calculate the pairwise scores between all candidate entities and select the most relevant group of entities as the final result. In this process, the consistency among wrong entities as well as that among right ones are involved, which may introduce noise data and increase the model complexity. Second, the cues of previously disambiguated entities, which could contribute to the disambiguation of the subsequent mentions, are usually ignored by previous models. To address these problems, we convert the global linking into a sequence decision problem and propose a reinforcement learning model which makes decisions from a global perspective. Our model makes full use of the previous referred entities and explores the long-term influence of current selection on subsequent decisions. We conduct experiments on different types of datasets, the results show that our model outperforms state-of-the-art systems and has better generalization performance.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Entity Linking | MSNBC | Micro F192.8 | 36 | |
| Entity Disambiguation | AIDA CoNLL (test) | In-KB Accuracy94.3 | 36 | |
| Entity Linking | CWEB (test) | Micro F178.5 | 26 | |
| Named Entity Disambiguation | AIDA (test) | Micro InKB F194.3 | 25 | |
| Entity Disambiguation | Wiki (test) | Micro F182.8 | 24 | |
| Entity Disambiguation | ACE 2004 (test) | Micro F191.2 | 20 | |
| Entity Disambiguation | AQUAINT (AQ) (test) | Micro F187.5 | 20 | |
| Named Entity Disambiguation | MSNBC out-of-domain (test) | Micro F1 (InKB)92.8 | 18 | |
| Entity Disambiguation | Standard Entity Disambiguation Datasets (AIDA, MSNBC, AQUAINT, ACE2004, CWEB, WIKI) InKB (test) | AIDA Score94.3 | 15 | |
| Named Entity Disambiguation | ACE out-of-domain 2004 (test) | Micro F1 (InKB)91.2 | 13 |