Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

UniTrans: Unifying Model Transfer and Data Transfer for Cross-Lingual Named Entity Recognition with Unlabeled Data

About

Prior works in cross-lingual named entity recognition (NER) with no/little labeled data fall into two primary categories: model transfer based and data transfer based methods. In this paper we find that both method types can complement each other, in the sense that, the former can exploit context information via language-independent features but sees no task-specific information in the target language; while the latter generally generates pseudo target-language training data via translation but its exploitation of context information is weakened by inaccurate translations. Moreover, prior works rarely leverage unlabeled data in the target language, which can be effortlessly collected and potentially contains valuable information for improved results. To handle both problems, we propose a novel approach termed UniTrans to Unify both model and data Transfer for cross-lingual NER, and furthermore, to leverage the available information from unlabeled target-language data via enhanced knowledge distillation. We evaluate our proposed UniTrans over 4 target languages on benchmark datasets. Our experimental results show that it substantially outperforms the existing state-of-the-art methods.

Qianhui Wu, Zijia Lin, B\"orje F. Karlsson, Biqing Huang, Jian-Guang Lou• 2020

Related benchmarks

TaskDatasetResultRank
Named Entity RecognitionCoNLL Spanish NER 2002 (test)
F1 Score79.31
98
Named Entity RecognitionCoNLL Dutch 2002 (test)
F1 Score82.9
87
Named Entity RecognitionCoNLL German 2003 (test)
F1 Score73.22
78
Named Entity RecognitionCoNLL NER 2002/2003 (test)
German F1 Score73.61
59
Named Entity RecognitionWikiAnn (test)
Average Accuracy55.91
58
Named Entity RecognitionCoNLL (test)
F1 Score (de)74.82
28
Named Entity RecognitionCoNLL de 2003 (test)
F1 Score74.82
12
Named Entity RecognitionNoDaLiDa no 2019 (test)
F1 Score81.17
3
Showing 8 of 8 rows

Other info

Follow for update