Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Fast and Accurate Entity Recognition with Iterated Dilated Convolutions

About

Today when many practitioners run basic NLP on the entire web and large-volume traffic, faster methods are paramount to saving time and energy costs. Recent advances in GPU hardware have led to the emergence of bi-directional LSTMs as a standard method for obtaining per-token vector representations serving as input to labeling tasks such as NER (often followed by prediction in a linear-chain CRF). Though expressive and accurate, these models fail to fully exploit GPU parallelism, limiting their computational efficiency. This paper proposes a faster alternative to Bi-LSTMs for NER: Iterated Dilated Convolutional Neural Networks (ID-CNNs), which have better capacity than traditional CNNs for large context and structured prediction. Unlike LSTMs whose sequential processing on sentences of length N requires O(N) time even in the face of parallelism, ID-CNNs permit fixed-depth convolutions to run in parallel across entire documents. We describe a distinct combination of network structure, parameter sharing and training procedures that enable dramatic 14-20x test-time speedups while retaining accuracy comparable to the Bi-LSTM-CRF. Moreover, ID-CNNs trained to aggregate context from the entire document are even more accurate while maintaining 8x faster test time speeds.

Emma Strubell, Patrick Verga, David Belanger, Andrew McCallum• 2017

Related benchmarks

TaskDatasetResultRank
Named Entity RecognitionCoNLL 2003 (test)
F1 Score90.85
539
Named Entity RecognitionCoNLL 03
F1 (Entity)90.54
102
Named Entity RecognitionOntoNotes
F1-score86.8
91
Named Entity RecognitionOntoNotes 5.0 (test)
F1 Score86.99
90
Named Entity RecognitionConll 2003
F1 Score90.54
86
Named Entity RecognitionOntoNotes 5.0
F1 Score86.84
79
Named Entity RecognitionWNUT 2017 (test)
F1 Score38.24
63
Named Entity RecognitionACE05
F1 Score86.99
38
Named Entity RecognitionOntoNotes (test)
F1 Score86.99
34
Named Entity RecognitionCoNLL (test)--
28
Showing 10 of 12 rows

Other info

Code

Follow for update