Span-based Semantic Parsing for Compositional Generalization

About

Despite the success of sequence-to-sequence (seq2seq) models in semantic parsing, recent work has shown that they fail in compositional generalization, i.e., the ability to generalize to new structures built of components observed during training. In this work, we posit that a span-based parser should lead to better compositional generalization. we propose SpanBasedSP, a parser that predicts a span tree over an input utterance, explicitly encoding how partial programs compose over spans in the input. SpanBasedSP extends Pasupat et al. (2019) to be comparable to seq2seq models by (i) training from programs, without access to gold trees, treating trees as latent variables, (ii) parsing a class of non-projective trees through an extension to standard CKY. On GeoQuery, SCAN and CLOSURE datasets, SpanBasedSP performs similarly to strong seq2seq baselines on random splits, but dramatically improves performance compared to baselines on splits that require compositional generalization: from $61.0 \rightarrow 88.9$ average accuracy.

Jonathan Herzig, Jonathan Berant• 2020

Related benchmarks

Task	Dataset	Result
Semantic Parsing	GeoQuery (i.i.d.)	Exact Match Accuracy78.9	32
Semantic Parsing	GeoQuery compositional	Accuracy76.3	29
Semantic Parsing	GEO	Accuracy0.822	26
Semantic Parsing	COGS (generalization)	--	25
Text-to-SQL	Geoquery	Exact Match Accuracy82.2	17
Semantic Parsing	Geoquery (test)	Accuracy86.4	14
Semantic Parsing	CFQ (MCD1, MCD2, MCD3)	MCD1 Accuracy65.1	9

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord