Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic Perspective

About

NER model has achieved promising performance on standard NER benchmarks. However, recent studies show that previous approaches may over-rely on entity mention information, resulting in poor performance on out-of-vocabulary (OOV) entity recognition. In this work, we propose MINER, a novel NER learning framework, to remedy this issue from an information-theoretic perspective. The proposed approach contains two mutual information-based training objectives: i) generalizing information maximization, which enhances representation via deep understanding of context and entity surface forms; ii) superfluous information minimization, which discourages representation from rote memorizing entity names or exploiting biased cues in data. Experiments on various settings and datasets demonstrate that it achieves better performance in predicting OOV entities.

Xiao Wang, Shihan Dou, Limao Xiong, Yicheng Zou, Qi Zhang, Tao Gui, Liang Qiao, Zhanzhan Cheng, Xuanjing Huang• 2022

Related benchmarks

TaskDatasetResultRank
Named Entity RecognitionCoNLL 2003 (test)--
539
Named Entity RecognitionWNUT 2017 (test)
F1 Score54.86
63
Named Entity RecognitionTwitterNER (test)
F1 Score75.38
10
Named Entity RecognitionBioNER (test)
F1 Score77.03
10
Showing 4 of 4 rows

Other info

Code

Follow for update