GraphAF: a Flow-based Autoregressive Model for Molecular Graph Generation
About
Molecular graph generation is a fundamental problem for drug discovery and has been attracting growing attention. The problem is challenging since it requires not only generating chemically valid molecular structures but also optimizing their chemical properties in the meantime. Inspired by the recent progress in deep generative models, in this paper we propose a flow-based autoregressive model for graph generation called GraphAF. GraphAF combines the advantages of both autoregressive and flow-based approaches and enjoys: (1) high model flexibility for data density estimation; (2) efficient parallel computation for training; (3) an iterative sampling process, which allows leveraging chemical domain knowledge for valency checking. Experimental results show that GraphAF is able to generate 68% chemically valid molecules even without chemical knowledge rules and 100% valid molecules with chemical rules. The training process of GraphAF is two times faster than the existing state-of-the-art approach GCPN. After fine-tuning the model for goal-directed property optimization with reinforcement learning, GraphAF achieves state-of-the-art performance on both chemical property optimization and constrained property optimization.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Property optimization | ZINC250k (test) | 1st Order Metric0.948 | 33 | |
| Constrained Property Optimization | ZINC250K | Improvement4.99 | 27 | |
| Abstract graph generation | Community small | Degree0.06 | 17 | |
| Abstract graph generation | Ego small | Average MMD0.0293 | 17 | |
| structure-based drug design | protein targets (Set B) | Uniqueness97 | 14 | |
| Graph generation | Ego-small (test) | Degree0.03 | 11 | |
| Distribution-learning | QM9 | Uniqueness50 | 10 | |
| Distribution-learning | ZINC250K | Uniqueness28.8 | 10 | |
| Molecular Graph Generation | QM9 | Validity67 | 10 | |
| Molecular Generation | ZINC 250K | Validity68.47 | 9 |