Training with Exploration Improves a Greedy Stack-LSTM Parser
About
We adapt the greedy Stack-LSTM dependency parser of Dyer et al. (2015) to support a training-with-exploration procedure using dynamic oracles(Goldberg and Nivre, 2013) instead of cross-entropy minimization. This form of training, which accounts for model predictions at training time rather than assuming an error-free action history, improves parsing accuracies for both English and Chinese, obtaining very strong results for both languages. We discuss some modifications needed in order to get training with exploration to work well for a probabilistic neural-network.
Miguel Ballesteros, Yoav Goldberg, Chris Dyer, Noah A. Smith• 2016
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Dependency Parsing | Chinese Treebank (CTB) (test) | UAS87.7 | 99 | |
| Dependency Parsing | English PTB Stanford Dependencies (test) | UAS93.6 | 76 | |
| Dependency Parsing | CoNLL German 2009 (test) | UAS90.34 | 25 | |
| Dependency Parsing | CoNLL Spanish 2009 (test) | UAS91.09 | 14 | |
| Dependency Parsing | CoNLL English 2009 (test) | UAS92.22 | 13 | |
| Dependency Parsing | CoNLL Chinese 2009 (test) | UAS83.54 | 12 | |
| Dependency Parsing | CoNLL Czech 2009 (test) | UAS85.68 | 12 | |
| Dependency Parsing | CoNLL Japanese 2009 (test) | UAS93.47 | 9 | |
| Parsing | English PTB-SD 3.3.0 (test) | UAS93.56 | 7 | |
| Dependency Parsing | CoNLL Catalan 2009 (test) | UAS90.45 | 6 |
Showing 10 of 11 rows