Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Distantly Supervised Named Entity Recognition via Confidence-Based Multi-Class Positive and Unlabeled Learning

About

In this paper, we study the named entity recognition (NER) problem under distant supervision. Due to the incompleteness of the external dictionaries and/or knowledge bases, such distantly annotated training data usually suffer from a high false negative rate. To this end, we formulate the Distantly Supervised NER (DS-NER) problem via Multi-class Positive and Unlabeled (MPU) learning and propose a theoretically and practically novel CONFidence-based MPU (Conf-MPU) approach. To handle the incomplete annotations, Conf-MPU consists of two steps. First, a confidence score is estimated for each token of being an entity token. Then, the proposed Conf-MPU risk estimation is applied to train a multi-class classifier for the NER task. Thorough experiments on two benchmark datasets labeled by various external knowledge demonstrate the superiority of the proposed Conf-MPU over existing DS-NER methods.

Kang Zhou, Yuepei Li, Qi Li• 2022

Related benchmarks

TaskDatasetResultRank
Named Entity RecognitionBC5CDR (test)
Macro F1 (span-level)80.1
80
Named Entity RecognitionCoNLL KB-Matching 2003 (test)
F1 Score80.02
24
Named Entity RecognitionBC5CDR Big Dict (test)
F1 Score80.07
11
Named Entity RecognitionBC5CDR Small Dict (test)
F1 Score76.18
8
Named Entity RecognitionCoNLL Dict 2003 (test)
F1 Score83.34
8
Showing 5 of 5 rows

Other info

Follow for update