Training with Exploration Improves a Greedy Stack-LSTM Parser

About

We adapt the greedy Stack-LSTM dependency parser of Dyer et al. (2015) to support a training-with-exploration procedure using dynamic oracles(Goldberg and Nivre, 2013) instead of cross-entropy minimization. This form of training, which accounts for model predictions at training time rather than assuming an error-free action history, improves parsing accuracies for both English and Chinese, obtaining very strong results for both languages. We discuss some modifications needed in order to get training with exploration to work well for a probabilistic neural-network.

Miguel Ballesteros, Yoav Goldberg, Chris Dyer, Noah A. Smith• 2016

Related benchmarks

Task	Dataset	Result
Dependency Parsing	Chinese Treebank (CTB) (test)	UAS87.7	99
Dependency Parsing	English PTB Stanford Dependencies (test)	UAS93.6	76
Dependency Parsing	CoNLL German 2009 (test)	UAS90.34	25
Dependency Parsing	CoNLL Spanish 2009 (test)	UAS91.09	14
Dependency Parsing	CoNLL English 2009 (test)	UAS92.22	13
Dependency Parsing	CoNLL Chinese 2009 (test)	UAS83.54	12
Dependency Parsing	CoNLL Czech 2009 (test)	UAS85.68	12
Dependency Parsing	CoNLL Japanese 2009 (test)	UAS93.47	9
Parsing	English PTB-SD 3.3.0 (test)	UAS93.56	7
Dependency Parsing	CoNLL Catalan 2009 (test)	UAS90.45	6

Showing 10 of 11 rows

Other info

Follow for update

@wizwand_team Discord