Enhanced Meta-Learning for Cross-lingual Named Entity Recognition with Minimal Resources

About

For languages with no annotated resources, transferring knowledge from rich-resource languages is an effective solution for named entity recognition (NER). While all existing methods directly transfer from source-learned model to a target language, in this paper, we propose to fine-tune the learned model with a few similar examples given a test case, which could benefit the prediction by leveraging the structural and semantic information conveyed in such similar examples. To this end, we present a meta-learning algorithm to find a good model parameter initialization that could fast adapt to the given test case and propose to construct multiple pseudo-NER tasks for meta-training by computing sentence similarities. To further improve the model's generalization ability across different languages, we introduce a masking scheme and augment the loss function with an additional maximum term during meta-training. We conduct extensive experiments on cross-lingual named entity recognition with minimal resources over five target languages. The results show that our approach significantly outperforms existing state-of-the-art methods across the board.

Qianhui Wu, Zijia Lin, Guoxin Wang, Hui Chen, B\"orje F. Karlsson, Biqing Huang, Chin-Yew Lin• 2019

Related benchmarks

Task	Dataset	Result
Named Entity Recognition	CoNLL Spanish NER 2002 (test)	F1 Score76.75	98
Named Entity Recognition	CoNLL Dutch 2002 (test)	F1 Score80.44	87
Named Entity Recognition	CoNLL German 2003 (test)	F1 Score73.16	78
Named Entity Recognition	CoNLL de 2003 (test)	F1 Score73.16	12
Named Entity Recognition	CoNLL-2002 (es, nl), CoNLL-2003 (de), Europeana Newspapers (fr), MSRA (zh) (test)	F1 Score (es)76.75	8

Showing 5 of 5 rows

Other info

Code

Follow for update

@wizwand_team Discord