Graph Diffusion Transformers for Multi-Conditional Molecular Generation
About
Inverse molecular design with diffusion models holds great potential for advancements in material and drug discovery. Despite success in unconditional molecular generation, integrating multiple properties such as synthetic score and gas permeability as condition constraints into diffusion models remains unexplored. We present the Graph Diffusion Transformer (Graph DiT) for multi-conditional molecular generation. Graph DiT integrates an encoder to learn numerical and categorical property representations with the Transformer-based denoiser. Unlike previous graph diffusion models that add noise separately on the atoms and bonds in the forward diffusion process, Graph DiT is trained with a novel graph-dependent noise model for accurate estimation of graph-related noise in molecules. We extensively validate Graph DiT for multi-conditional polymer and small molecule generation. Results demonstrate the superiority of Graph DiT across nine metrics from distribution learning to condition control for molecular properties. A polymer inverse design task for gas separation with feedback from domain experts further demonstrates its practical utility.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Controllable Molecular Generation | Molecular and Polymer properties 9 properties aggregation (test) | Average Rank2 | 27 | |
| Molecule Generation | ChEBI-20 (test) | Exact Match0.00e+0 | 26 | |
| Conditional 2D Molecular Graph Generation | Synth. & HIV | Diversity89.74 | 14 | |
| Heterogeneous Conditional Molecular Generation | 10K Molecules Drug-related task set | Validity83.44 | 14 | |
| Conditional 2D Molecular Graph Generation | Synth. & BBBP | Diversity88.56 | 14 | |
| Heterogeneous Conditional Molecular Generation | 10K Polymers | Validity68.68 | 14 | |
| Conditional 2D Molecular Graph Generation | Synth. & BACE | Diversity82.38 | 14 | |
| Conditional molecular generation | 10K Polymers (test) | Validity81.76 | 14 | |
| Molecule Generation | Polymer and Drug datasets (test) | Novelty91.39 | 14 | |
| Controllable Molecular Generation | DFT unseen targets: Ei, EPS (test) | Validity60.76 | 5 |