Graph Diffusion Policy Optimization

About

Recent research has made significant progress in optimizing diffusion models for downstream objectives, which is an important pursuit in fields such as graph generation for drug design. However, directly applying these models to graph presents challenges, resulting in suboptimal performance. This paper introduces graph diffusion policy optimization (GDPO), a novel approach to optimize graph diffusion models for arbitrary (e.g., non-differentiable) objectives using reinforcement learning. GDPO is based on an eager policy gradient tailored for graph diffusion models, developed through meticulous analysis and promising improved performance. Experimental results show that GDPO achieves state-of-the-art performance in various graph generation tasks with complex and diverse objectives. Code is available at https://github.com/sail-sg/GDPO.

Yijing Liu, Chao Du, Tianyu Pang, Chongxuan Li, Min Lin, Wei Chen• 2024

Related benchmarks

Task	Dataset	Result
Controllable Molecular Generation	Molecular and Polymer properties 9 properties aggregation (test)	Average Rank2	27
Graph generation	Planar Graphs (test)	Unique Node %73.83	25
Graph generation	SBM Graphs (test)	Degree0.15	25
Molecule Generation	Polymer and Drug datasets (test)	Novelty100	14
Heterogeneous Conditional Molecular Generation	10K Molecules Drug-related task set	Validity73.07	14
Heterogeneous Conditional Molecular Generation	10K Polymers	Validity65.04	14
Conditional molecular generation	10K Polymers (test)	Validity25.78	14
Protein Docking	ZINC250k target: braf (test)	DS (top 5%)-11.197	9
Protein Docking	ZINC250k target: parp1 (test)	DS (top 5%)-10.938	9
Protein Docking	ZINC250k target: fa7 (test)	Docking Score (top 5%)-8.691	9

Showing 10 of 18 rows

Other info

Code

Follow for update

@wizwand_team Discord