FLERT: Document-Level Features for Named Entity Recognition
About
Current state-of-the-art approaches for named entity recognition (NER) typically consider text at the sentence-level and thus do not model information that crosses sentence boundaries. However, the use of transformer-based models for NER offers natural options for capturing document-level features. In this paper, we perform a comparative evaluation of document-level features in the two standard NER architectures commonly considered in the literature, namely "fine-tuning" and "feature-based LSTM-CRF". We evaluate different hyperparameters for document-level features such as context window size and enforcing document-locality. We present experiments from which we derive recommendations for how to model document context and present new state-of-the-art scores on several CoNLL-03 benchmark datasets. Our approach is integrated into the Flair framework to facilitate reproduction of our experiments.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Named Entity Recognition | CoNLL English 2003 (test) | F1 Score94.09 | 135 | |
| Named Entity Recognition | CoNLL Spanish NER 2002 (test) | F1 Score90.14 | 98 | |
| Named Entity Recognition | CoNLL German 2003 (test) | F1 Score88.34 | 78 | |
| Named Entity Recognition | CoNLL Dutch 2002 original (test) | F1 Score95.21 | 9 | |
| Named Entity Recognition | CoNLL-2003 German 2006 Revised (test) | F1 Score92.23 | 8 |