Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision

About

We study the open-domain named entity recognition (NER) problem under distant supervision. The distant supervision, though does not require large amounts of manual annotations, yields highly incomplete and noisy distant labels via external knowledge bases. To address this challenge, we propose a new computational framework -- BOND, which leverages the power of pre-trained language models (e.g., BERT and RoBERTa) to improve the prediction performance of NER models. Specifically, we propose a two-stage training algorithm: In the first stage, we adapt the pre-trained language model to the NER tasks using the distant labels, which can significantly improve the recall and precision; In the second stage, we drop the distant labels, and propose a self-training approach to further improve the model performance. Thorough experiments on 5 benchmark datasets demonstrate the superiority of BOND over existing distantly supervised NER methods. The code and distantly labeled data have been released in https://github.com/cliang1453/BOND.

Chen Liang, Yue Yu, Haoming Jiang, Siawpeng Er, Ruijia Wang, Tuo Zhao, Chao Zhang• 2020

Related benchmarks

TaskDatasetResultRank
Named Entity RecognitionCoNLL 2003 (test)
F1 Score83.5
539
Named Entity RecognitionBC5CDR (test)
Macro F1 (span-level)81.1
80
Named Entity RecognitionBC5CDR
F1 Score83.18
59
Named Entity RecognitionNCBI-disease (test)
Precision70.8
40
Named Entity RecognitionNCBI-disease
F1 Score80.33
29
Named Entity RecognitionWNUT 2016 (test)
F1 Score42.1
26
Named Entity RecognitionCoNLL KB-Matching 2003 (test)
F1 Score79.83
24
Named Entity RecognitionLaptopReview
F1 Score67.19
12
Named Entity RecognitionCoNLL2003 String-Matching (test)
F1 Score75.51
11
Named Entity RecognitionBC5CDR Big Dict (test)
F1 Score73.66
11
Showing 10 of 19 rows

Other info

Follow for update