Fast and Accurate Entity Recognition with Iterated Dilated Convolutions
About
Today when many practitioners run basic NLP on the entire web and large-volume traffic, faster methods are paramount to saving time and energy costs. Recent advances in GPU hardware have led to the emergence of bi-directional LSTMs as a standard method for obtaining per-token vector representations serving as input to labeling tasks such as NER (often followed by prediction in a linear-chain CRF). Though expressive and accurate, these models fail to fully exploit GPU parallelism, limiting their computational efficiency. This paper proposes a faster alternative to Bi-LSTMs for NER: Iterated Dilated Convolutional Neural Networks (ID-CNNs), which have better capacity than traditional CNNs for large context and structured prediction. Unlike LSTMs whose sequential processing on sentences of length N requires O(N) time even in the face of parallelism, ID-CNNs permit fixed-depth convolutions to run in parallel across entire documents. We describe a distinct combination of network structure, parameter sharing and training procedures that enable dramatic 14-20x test-time speedups while retaining accuracy comparable to the Bi-LSTM-CRF. Moreover, ID-CNNs trained to aggregate context from the entire document are even more accurate while maintaining 8x faster test time speeds.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Named Entity Recognition | CoNLL 2003 (test) | F1 Score90.85 | 539 | |
| Named Entity Recognition | CoNLL 03 | F1 (Entity)90.54 | 102 | |
| Named Entity Recognition | OntoNotes | F1-score86.8 | 91 | |
| Named Entity Recognition | OntoNotes 5.0 (test) | F1 Score86.99 | 90 | |
| Named Entity Recognition | Conll 2003 | F1 Score90.54 | 86 | |
| Named Entity Recognition | OntoNotes 5.0 | F1 Score86.84 | 79 | |
| Named Entity Recognition | WNUT 2017 (test) | F1 Score38.24 | 63 | |
| Named Entity Recognition | ACE05 | F1 Score86.99 | 38 | |
| Named Entity Recognition | OntoNotes (test) | F1 Score86.99 | 34 | |
| Named Entity Recognition | CoNLL (test) | -- | 28 |