FLERT: Document-Level Features for Named Entity Recognition

About

Current state-of-the-art approaches for named entity recognition (NER) typically consider text at the sentence-level and thus do not model information that crosses sentence boundaries. However, the use of transformer-based models for NER offers natural options for capturing document-level features. In this paper, we perform a comparative evaluation of document-level features in the two standard NER architectures commonly considered in the literature, namely "fine-tuning" and "feature-based LSTM-CRF". We evaluate different hyperparameters for document-level features such as context window size and enforcing document-locality. We present experiments from which we derive recommendations for how to model document context and present new state-of-the-art scores on several CoNLL-03 benchmark datasets. Our approach is integrated into the Flair framework to facilitate reproduction of our experiments.

Stefan Schweter, Alan Akbik• 2020

Related benchmarks

Task	Dataset	Result
Named Entity Recognition	CoNLL English 2003 (test)	F1 Score94.09	135
Named Entity Recognition	CoNLL Spanish NER 2002 (test)	F1 Score90.14	98
Named Entity Recognition	CoNLL German 2003 (test)	F1 Score88.34	78
Named Entity Recognition	CoNLL Dutch 2002 original (test)	F1 Score95.21	9
Named Entity Recognition	CoNLL-2003 German 2006 Revised (test)	F1 Score92.23	8

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord