Global Entity Disambiguation with BERT

About

We propose a global entity disambiguation (ED) model based on BERT. To capture global contextual information for ED, our model treats not only words but also entities as input tokens, and solves the task by sequentially resolving mentions to their referent entities and using resolved entities as inputs at each step. We train the model using a large entity-annotated corpus obtained from Wikipedia. We achieve new state-of-the-art results on five standard ED datasets: AIDA-CoNLL, MSNBC, AQUAINT, ACE2004, and WNED-WIKI. The source code and model checkpoint are available at https://github.com/studio-ousia/luke.

Ikuya Yamada, Koki Washio, Hiroyuki Shindo, Yuji Matsumoto• 2019

Related benchmarks

Task	Dataset	Result
Entity Disambiguation	AIDA CoNLL (test)	In-KB Accuracy97.1	36
Entity Disambiguation	ZELDA Benchmark (test)	AIDA-B81.2	35
Entity Linking	CWEB (test)	Micro F178.9	26
Entity Disambiguation	Wiki (test)	Micro F189.2	24
Entity Disambiguation	AQUAINT (AQ) (test)	Micro F193.5	20
Entity Disambiguation	ACE 2004 (test)	Micro F191.9	20
Entity Disambiguation	MSNBC (test)	Micro F196.3	13

Showing 7 of 7 rows

Other info

Code

Follow for update

@wizwand_team Discord