End-to-end Sequence Labeling via Bi-directional LSTM-CNNs-CRF

About

State-of-the-art sequence labeling systems traditionally require large amounts of task-specific knowledge in the form of hand-crafted features and data pre-processing. In this paper, we introduce a novel neutral network architecture that benefits from both word- and character-level representations automatically, by using combination of bidirectional LSTM, CNN and CRF. Our system is truly end-to-end, requiring no feature engineering or data pre-processing, thus making it applicable to a wide range of sequence labeling tasks. We evaluate our system on two data sets for two sequence labeling tasks --- Penn Treebank WSJ corpus for part-of-speech (POS) tagging and CoNLL 2003 corpus for named entity recognition (NER). We obtain state-of-the-art performance on both the two data --- 97.55\% accuracy for POS tagging and 91.21\% F1 for NER.

Xuezhe Ma, Eduard Hovy• 2016

Related benchmarks

Task	Dataset	Result
Named Entity Recognition	CoNLL 2003 (test)	F1 Score91.67	556
Named Entity Recognition	CoNLL English 2003 (test)	F1 Score91.21	135
Named Entity Recognition	OntoNotes	F1-score71.81	121
Named Entity Recognition	CoNLL Spanish NER 2002 (test)	F1 Score85.91	98
Named Entity Recognition	Wnut 2017	F1 Score57.46	91
Chunking	CoNLL 2000 (test)	F1 Score95.93	88
Named Entity Recognition	CoNLL Dutch 2002 (test)	F1 Score86.69	87
Named Entity Recognition	Conll 2003	F1 Score92.91	86
Named Entity Recognition	CoNLL German 2003 (test)	F1 Score78.15	78
Aspect-Term Sentiment Analysis	LAPTOP SemEval 2014 (test)	Macro-F154.71	69

Showing 10 of 39 rows

Other info

Follow for update

@wizwand_team Discord