Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Geometric Representation Condition Improves Equivariant Molecule Generation

About

Recent advances in molecular generative models have demonstrated great promise for accelerating scientific discovery, particularly in drug design. However, these models often struggle to generate high-quality molecules, especially in conditional scenarios where specific molecular properties must be satisfied. In this work, we introduce GeoRCG, a general framework to improve molecular generative models by integrating geometric representation conditions with provable theoretical guarantees. We decompose the generation process into two stages: first, generating an informative geometric representation; second, generating a molecule conditioned on the representation. Compared with single-stage generation, the easy-to-generate representation in the first stage guides the second stage generation toward a high-quality molecule in a goal-oriented way. Leveraging EDM and SemlaFlow as base generators, we observe significant quality improvements in unconditional molecule generation on the widely used QM9 and GEOM-DRUG datasets. More notably, in the challenging conditional molecular generation task, our framework achieves an average 50\% performance improvement over state-of-the-art approaches, highlighting the superiority of conditioning on semantically rich geometric representations. Furthermore, with such representation guidance, the number of diffusion steps can be reduced to as small as 100 while largely preserving the generation quality achieved with 1,000 steps, thereby significantly reducing the generation iterations needed. Code is available at https://github.com/GraphPKU/GeoRCG.

Zian Li, Cai Zhou, Xiyuan Wang, Xingang Peng, Muhan Zhang• 2024

Related benchmarks

TaskDatasetResultRank
Unconditional Molecule GenerationQM9 (test)
Validity95.05
54
Unconditional Molecule GenerationGEOM-DRUG (test)
Atom Stability (%)81.44
16
3D Molecular GenerationGEOM-DRUG
Atom Stability99.8
7
Showing 3 of 3 rows

Other info

Follow for update