Hierarchical Sketch Induction for Paraphrase Generation
About
We propose a generative model of paraphrase generation, that encourages syntactic diversity by conditioning on an explicit syntactic sketch. We introduce Hierarchical Refinement Quantized Variational Autoencoders (HRQ-VAE), a method for learning decompositions of dense encodings as a sequence of discrete latent variables that make iterative refinements of increasing granularity. This hierarchy of codes is learned through end-to-end training, and represents fine-to-coarse grained information about the input. We use HRQ-VAE to encode the syntactic form of an input sentence as a path through the hierarchy, allowing us to more easily predict syntactic sketches at test time. Extensive experiments, including a human evaluation, confirm that HRQ-VAE learns a hierarchical representation of the input space, and generates paraphrases of higher quality than previous systems.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Paraphrase Generation | QQP (test) | -- | 22 | |
| Paraphrase Generation | MSCOCO (test) | Self-BLEU16.58 | 14 | |
| Paraphrase Generation | Paralex (test) | BLEU39.49 | 11 | |
| Paraphrase Generation | Paralex | iBLEU22.75 | 4 | |
| Paraphrase Generation | QQP | iBLEU17.49 | 4 | |
| Paraphrase Generation | MSCOCO | iBLEU18.39 | 4 |