End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF
About
State-of-the-art sequence labeling systems traditionally require large amounts of task-specific knowledge in the form of hand-crafted features and data pre-processing. In this paper, we introduce a novel neutral network architecture that benefits from both word- and character-level representations automatically, by using combination of bidirectional LSTM, CNN and CRF. Our system is truly end-to-end, requiring no feature engineering or data pre-processing, thus making it applicable to a wide range of sequence labeling tasks. We evaluate our system on two data sets for two sequence labeling tasks --- Penn Treebank WSJ corpus for part-of-speech (POS) tagging and CoNLL 2003 corpus for named entity recognition (NER). We obtain state-of-the-art performance on both the two data --- 97.55\% accuracy for POS tagging and 91.21\% F1 for NER.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Named Entity Recognition | CoNLL 2003 (test) | F1 Score91.67 | 539 | |
| Named Entity Recognition | CoNLL English 2003 (test) | F1 Score91.21 | 135 | |
| Named Entity Recognition | CoNLL Spanish NER 2002 (test) | F1 Score85.91 | 98 | |
| Named Entity Recognition | OntoNotes | F1-score71.81 | 91 | |
| Chunking | CoNLL 2000 (test) | F1 Score95.93 | 88 | |
| Named Entity Recognition | CoNLL Dutch 2002 (test) | F1 Score86.69 | 87 | |
| Named Entity Recognition | Conll 2003 | F1 Score92.91 | 86 | |
| Named Entity Recognition | Wnut 2017 | F1 Score57.46 | 79 | |
| Named Entity Recognition | CoNLL German 2003 (test) | F1 Score78.15 | 78 | |
| Aspect-Term Sentiment Analysis | LAPTOP SemEval 2014 (test) | Macro-F154.71 | 69 |