Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

FLERT: Document-Level Features for Named Entity Recognition

About

Current state-of-the-art approaches for named entity recognition (NER) typically consider text at the sentence-level and thus do not model information that crosses sentence boundaries. However, the use of transformer-based models for NER offers natural options for capturing document-level features. In this paper, we perform a comparative evaluation of document-level features in the two standard NER architectures commonly considered in the literature, namely "fine-tuning" and "feature-based LSTM-CRF". We evaluate different hyperparameters for document-level features such as context window size and enforcing document-locality. We present experiments from which we derive recommendations for how to model document context and present new state-of-the-art scores on several CoNLL-03 benchmark datasets. Our approach is integrated into the Flair framework to facilitate reproduction of our experiments.

Stefan Schweter, Alan Akbik• 2020

Related benchmarks

TaskDatasetResultRank
Named Entity RecognitionCoNLL English 2003 (test)
F1 Score94.09
135
Named Entity RecognitionCoNLL Spanish NER 2002 (test)
F1 Score90.14
98
Named Entity RecognitionCoNLL German 2003 (test)
F1 Score88.34
78
Named Entity RecognitionCoNLL Dutch 2002 original (test)
F1 Score95.21
9
Named Entity RecognitionCoNLL-2003 German 2006 Revised (test)
F1 Score92.23
8
Showing 5 of 5 rows

Other info

Follow for update