Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CAN-NER: Convolutional Attention Network for Chinese Named Entity Recognition

About

Named entity recognition (NER) in Chinese is essential but difficult because of the lack of natural delimiters. Therefore, Chinese Word Segmentation (CWS) is usually considered as the first step for Chinese NER. However, models based on word-level embeddings and lexicon features often suffer from segmentation errors and out-of-vocabulary (OOV) words. In this paper, we investigate a Convolutional Attention Network called CAN for Chinese NER, which consists of a character-based convolutional neural network (CNN) with local-attention layer and a gated recurrent unit (GRU) with global self-attention layer to capture the information from adjacent characters and sentence contexts. Also, compared to other models, not depending on any external resources like lexicons and employing small size of char embeddings make our model more practical. Extensive experimental results show that our approach outperforms state-of-the-art methods without word embedding and external lexicon resources on different domain datasets including Weibo, MSRA and Chinese Resume NER dataset.

Yuying Zhu, Guoxin Wang, B\"orje F. Karlsson• 2019

Related benchmarks

TaskDatasetResultRank
Named Entity RecognitionMSRA (test)
F1 Score92.97
63
Named Entity RecognitionOntoNotes 4.0 (test)
F1 Score73.64
55
Named Entity RecognitionRESUME
F1 Score94.94
52
Named Entity RecognitionWeibo (test)
Overall Score59.31
50
Named Entity RecognitionMSRA
F1 Score92.97
29
Named Entity RecognitionResume (test)
F1 Score94.94
28
Named Entity RecognitionWeibo
F1 Score59.31
27
Named Entity RecognitionOntoNotes 4.0
F1 Score73.64
18
Named Entity RecognitionWeibo (WE)
F1 Score59.31
13
Named Entity RecognitionChinese Resume (test)
F1 Score94.94
12
Showing 10 of 14 rows

Other info

Follow for update