Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

A Conditional Splitting Framework for Efficient Constituency Parsing

About

We introduce a generic seq2seq parsing framework that casts constituency parsing problems (syntactic and discourse parsing) into a series of conditional splitting decisions. Our parsing model estimates the conditional probability distribution of possible splitting points in a given text span and supports efficient top-down decoding, which is linear in number of nodes. The conditional splitting formulation together with efficient beam search inference facilitate structural consistency without relying on expensive structured inference. Crucially, for discourse analysis we show that in our formulation, discourse segmentation can be framed as a special case of parsing which allows us to perform discourse parsing without requiring segmentation as a pre-requisite. Experiments show that our model achieves good results on the standard syntactic parsing tasks under settings with/without pre-trained representations and rivals state-of-the-art (SoTA) methods that are more computationally expensive than ours. In discourse parsing, our method outperforms SoTA by a good margin.

Thanh-Tung Nguyen, Xuan-Phi Nguyen, Shafiq Joty, Xiaoli Li• 2021

Related benchmarks

TaskDatasetResultRank
Constituent ParsingPTB (test)
F195.7
127
Phrase-structure parsingPTB (§23)
F1 Score95.7
56
Constituency ParsingPTB (test)
Speed (Sents/s)1.13e+3
12
Constituency ParsingPTB WSJ (Section 23 test)
F1 Score93.77
12
Discourse ParsingRST-DT (test)
Speedup44.5
11
Syntactic ParsingEnglish Penn Treebank (test)
Speed (Sents/s)1.13e+3
11
End-to-End Discourse ParsingRST-DT (test)
Span Score92.02
5
Discourse Parsing (with gold EDU segmentation)RST-DT (test)
Span Score97.37
5
Syntactic ParsingSPMRL 2014 (test)
Basque Language Score89.74
5
Showing 9 of 9 rows

Other info

Code

Follow for update