GraphDF: A Discrete Flow Model for Molecular Graph Generation
About
We consider the problem of molecular graph generation using deep models. While graphs are discrete, most existing methods use continuous latent variables, resulting in inaccurate modeling of discrete graph structures. In this work, we propose GraphDF, a novel discrete latent variable model for molecular graph generation based on normalizing flow methods. GraphDF uses invertible modulo shift transforms to map discrete latent variables to graph nodes and edges. We show that the use of discrete latent variables reduces computational costs and eliminates the negative effect of dequantization. Comprehensive experimental results show that GraphDF outperforms prior methods on random generation, property optimization, and constrained optimization tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Molecular Generation | ZINC250K | Uniqueness99.97 | 68 | |
| Graph generation | ENZYMES | Clustering1.061 | 45 | |
| Molecular Generation | QM9 | Validity89 | 40 | |
| Molecular Graph Generation | QM9 | Validity93.88 | 37 | |
| Property optimization | ZINC250k (test) | 1st Order Metric0.844 | 33 | |
| Molecular Generation | QM9 (test) | Validity93.88 | 32 | |
| Molecule Generation | ZINC250K | Validity89.72 | 32 | |
| Constrained Property Optimization | ZINC250K | Improvement5.62 | 27 | |
| Abstract graph generation | Ego small | Average MMD0.06 | 27 | |
| Molecular Generation | ZINC250k (test) | Validity90.61 | 26 |