Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

An Analysis of Simple Data Augmentation for Named Entity Recognition

About

Simple yet effective data augmentation techniques have been proposed for sentence-level and sentence-pair natural language processing tasks. Inspired by these efforts, we design and compare data augmentation for named entity recognition, which is usually modeled as a token-level sequence labeling problem. Through experiments on two data sets from the biomedical and materials science domains (i2b2-2010 and MaSciP), we show that simple augmentation can boost performance for both recurrent and transformer-based models, especially for small training sets.

Xiang Dai, Heike Adel• 2020

Related benchmarks

TaskDatasetResultRank
Named Entity RecognitionCoNLL 2003 (test)--
539
Named Entity RecognitionCoNLL 03
F1 (Entity)83.74
102
Named Entity RecognitionOntoNotes
F1-score62.67
91
Complex Named Entity RecognitionMultiCoNER (test)
Score (Bn)39.9
76
Named Entity RecognitionWNUT 2017 (test)
F1 Score52.29
63
Named Entity RecognitionMultiCoNER
F1 Score0.5467
48
Named Entity RecognitionNCBI
F1 Score78.97
26
Named Entity Recognitionbc2gm
Entity F160.46
21
Named Entity RecognitionTwitter NER
F1 Score73.69
14
Named Entity RecognitionCoNLL
F1 Score0.8508
10
Showing 10 of 16 rows

Other info

Follow for update