Lexicon Infused Phrase Embeddings for Named Entity Resolution

About

Most state-of-the-art approaches for named-entity recognition (NER) use semi supervised information in the form of word clusters and lexicons. Recently neural network-based language models have been explored, as they as a byproduct generate highly informative vector representations for words, known as word embeddings. In this paper we present two contributions: a new form of learning word embeddings that can leverage information from relevant lexicons to improve the representations, and the first system to use neural word embeddings to achieve state-of-the-art results on named-entity recognition in both CoNLL and Ontonotes NER. Our system achieves an F1 score of 90.90 on the test set for CoNLL 2003---significantly better than any previous system trained on public data, and matching a system employing massive private industrial query-log data.

Alexandre Passos, Vineet Kumar, Andrew McCallum• 2014

Related benchmarks

Task	Dataset	Result
Named Entity Recognition	CoNLL 2003 (test)	F1 Score90.9	556
Named Entity Recognition	CoNLL English 2003 (test)	F1 Score90.9	135
Named Entity Recognition	OntoNotes 5.0	F1 Score82.3	90
Named Entity Recognition	OntoNotes 5.0 (test)	F1 Score82.24	90
Named Entity Recognition	Conll 2003	F1 Score90.9	86
Named Entity Recognition	OntoNotes (test)	F1 Score82.24	34
Named Entity Recognition	CoNLL English 2003 (dev)	F1 Score94.46	26

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord