Span-Based Constituency Parsing with a Structure-Label System and Provably Optimal Dynamic Oracles
About
Parsing accuracy using efficient greedy transition systems has improved dramatically in recent years thanks to neural networks. Despite striking results in dependency parsing, however, neural models have not surpassed state-of-the-art approaches in constituency parsing. To remedy this, we introduce a new shift-reduce system whose stack contains merely sentence spans, represented by a bare minimum of LSTM features. We also design the first provably optimal dynamic oracle for constituency parsing, which runs in amortized O(1) time, compared to O(n^3) oracles for standard dependency parsing. Training with this oracle, we achieve the best F1 scores on both English and French of any parser that does not use reranking or external data.
James Cross, Liang Huang• 2016
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Constituent Parsing | PTB (test) | F191.3 | 127 | |
| Constituency Parsing | Penn Treebank WSJ (section 23 test) | F1 Score91.3 | 55 | |
| Constituency Parsing | WSJ Penn Treebank (test) | F1 Score91.3 | 27 | |
| Multilingual Constituency Parsing | SPMRL 2013 2014 (test) | French Score83.31 | 13 |
Showing 4 of 4 rows