Hierarchical Poset Decoding for Compositional Generalization in Language
About
We formalize human language understanding as a structured prediction task where the output is a partially ordered set (poset). Current encoder-decoder architectures do not take the poset structure of semantics into account properly, thus suffering from poor compositional generalization ability. In this paper, we propose a novel hierarchical poset decoding paradigm for compositional generalization in language. Intuitively: (1) the proposed paradigm enforces partial permutation invariance in semantics, thus avoiding overfitting to bias ordering information; (2) the hierarchical mechanism allows to capture high-level structures of posets. We evaluate our proposed decoder on Compositional Freebase Questions (CFQ), a large and realistic natural language question answering dataset that is specifically designed to measure compositional generalization. Results show that it outperforms current decoders.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Semantic Parsing | CFQ MCD3 | Accuracy67.8 | 33 | |
| Semantic Parsing | CFQ (MCD2) | Accuracy59.6 | 33 | |
| Semantic Parsing | CFQ (MCD1) | Accuracy79.6 | 33 | |
| Semantic Parsing | CFQ MCD avg | Exact Match Accuracy69 | 22 | |
| Semantic Parsing | CFQ MCD3 (test) | Accuracy67.8 | 15 | |
| Semantic Parsing | CFQ MCD2 (test) | Accuracy0.596 | 15 | |
| Semantic Parsing | CFQ MCD1 (test) | Accuracy79.6 | 15 | |
| Semantic Parsing | CFQ (MCD1, MCD2, MCD3) | MCD1 Accuracy79.6 | 9 | |
| Semantic Parsing | CFQ Maximum Compound Divergence (MCD) | MCD172 | 4 |