Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Efficient Constituency Parsing by Pointing

About

We propose a novel constituency parsing model that casts the parsing problem into a series of pointing tasks. Specifically, our model estimates the likelihood of a span being a legitimate tree constituent via the pointing score corresponding to the boundary words of the span. Our parsing model supports efficient top-down decoding and our learning objective is able to enforce structural consistency without resorting to the expensive CKY inference. The experiments on the standard English Penn Treebank parsing task show that our method achieves 92.78 F1 without using pre-trained models, which is higher than all the existing methods with similar time complexity. Using pre-trained BERT, our model achieves 95.48 F1, which is competitive with the state-of-the-art while being faster. Our approach also establishes new state-of-the-art in Basque and Swedish in the SPMRL shared tasks on multilingual constituency parsing.

Thanh-Tung Nguyen, Xuan-Phi Nguyen, Shafiq Joty, Xiaoli Li• 2020

Related benchmarks

TaskDatasetResultRank
Constituent ParsingPTB (test)
F195.48
127
Phrase-structure parsingPTB (§23)
F1 Score95.5
56
Multilingual Constituency ParsingSPMRL 2013 2014 (test)
French Score86.69
13
Constituency ParsingPTB WSJ (Section 23 test)
F1 Score92.78
12
Constituency ParsingPTB (test)
Speed (Sents/s)130
12
Syntactic ParsingEnglish Penn Treebank (test)
Speed (Sents/s)130
11
Syntactic ParsingSPMRL 2014 (test)
Basque Language Score90.23
5
Showing 7 of 7 rows

Other info

Follow for update