Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

About

State-of-the-art sequence labeling systems traditionally require large amounts of task-specific knowledge in the form of hand-crafted features and data pre-processing. In this paper, we introduce a novel neutral network architecture that benefits from both word- and character-level representations automatically, by using combination of bidirectional LSTM, CNN and CRF. Our system is truly end-to-end, requiring no feature engineering or data pre-processing, thus making it applicable to a wide range of sequence labeling tasks. We evaluate our system on two data sets for two sequence labeling tasks --- Penn Treebank WSJ corpus for part-of-speech (POS) tagging and CoNLL 2003 corpus for named entity recognition (NER). We obtain state-of-the-art performance on both the two data --- 97.55\% accuracy for POS tagging and 91.21\% F1 for NER.

Xuezhe Ma, Eduard Hovy• 2016

Related benchmarks

TaskDatasetResultRank
Named Entity RecognitionCoNLL 2003 (test)
F1 Score91.67
539
Named Entity RecognitionCoNLL English 2003 (test)
F1 Score91.21
135
Named Entity RecognitionCoNLL Spanish NER 2002 (test)
F1 Score85.91
98
Named Entity RecognitionOntoNotes
F1-score71.81
91
ChunkingCoNLL 2000 (test)
F1 Score95.93
88
Named Entity RecognitionCoNLL Dutch 2002 (test)
F1 Score86.69
87
Named Entity RecognitionConll 2003
F1 Score92.91
86
Named Entity RecognitionWnut 2017
F1 Score57.46
79
Named Entity RecognitionCoNLL German 2003 (test)
F1 Score78.15
78
Aspect-Term Sentiment AnalysisLAPTOP SemEval 2014 (test)
Macro-F154.71
69
Showing 10 of 39 rows

Other info

Follow for update