Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multilingual Autoregressive Entity Linking

About

We present mGENRE, a sequence-to-sequence system for the Multilingual Entity Linking (MEL) problem -- the task of resolving language-specific mentions to a multilingual Knowledge Base (KB). For a mention in a given language, mGENRE predicts the name of the target entity left-to-right, token-by-token in an autoregressive fashion. The autoregressive formulation allows us to effectively cross-encode mention string and entity names to capture more interactions than the standard dot product between mention and entity vectors. It also enables fast search within a large KB even for mentions that do not appear in mention tables and with no need for large-scale vector indices. While prior MEL works use a single representation for each entity, we match against entity names of as many languages as possible, which allows exploiting language connections between source input and target name. Moreover, in a zero-shot setting on languages with no training data at all, mGENRE treats the target language as a latent variable that is marginalized at prediction time. This leads to over 50% improvements in average accuracy. We show the efficacy of our approach through extensive evaluation including experiments on three popular MEL benchmarks where mGENRE establishes new state-of-the-art results. Code and pre-trained models at https://github.com/facebookresearch/GENRE.

Nicola De Cao, Ledell Wu, Kashyap Popat, Mikel Artetxe, Naman Goyal, Mikhail Plekhanov, Luke Zettlemoyer, Nicola Cancedda, Sebastian Riedel, Fabio Petroni• 2021

Related benchmarks

TaskDatasetResultRank
End-to-end Entity LinkingENEIDE DZ
Precision55.2
12
End-to-end Entity LinkingAMD ENEIDE
Precision64.83
12
Cross-lingual Biomedical Entity LinkingXL-BEL
EN Score85.2
10
Biomedical Entity LinkingEMEA
Score (ES)56.5
10
Biomedical Entity LinkingPatent
FR Score61.1
10
Biomedical Entity LinkingWikiMed DE
DE Score31.8
9
Entity LinkingTAC-KBP FreeBase linked 2015
Accuracy (es)86.9
8
Entity LinkingMHERCL
F1 (en)47
8
Entity LinkingMewsli-9 (test)
Recall@190.6
7
Entity LinkingTR hard 2016
Accuracy (de)61.8
7
Showing 10 of 14 rows

Other info

Code

Follow for update