Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GoMS: Graph of Molecule Substructure Network for Molecule Property Prediction

About

While graph neural networks have shown remarkable success in molecular property prediction, current approaches like the Equivariant Subgraph Aggregation Networks (ESAN) treat molecules as bags of independent substructures, overlooking crucial relationships between these components. We present Graph of Molecule Substructures (GoMS), a novel architecture that explicitly models the interactions and spatial arrangements between molecular substructures. Unlike ESAN's bag-based representation, GoMS constructs a graph where nodes represent subgraphs and edges capture their structural relationships, preserving critical topological information about how substructures are connected and overlap within the molecule. Through extensive experiments on public molecular datasets, we demonstrate that GoMS outperforms ESAN and other baseline methods, with particularly improvements for large molecules containing more than 100 atoms. The performance gap widens as molecular size increases, demonstrating GoMS's effectiveness for modeling industrial-scale molecules. Our theoretical analysis demonstrates that GoMS can distinguish molecules with identical subgraph compositions but different spatial arrangements. Our approach shows particular promise for materials science applications involving complex molecules where properties emerge from the interplay between multiple functional units. By capturing substructure relationships that are lost in bag-based approaches, GoMS represents a significant advance toward scalable and interpretable molecular property prediction for real-world applications.

Shuhui Qu, Cheolwoo Park• 2025

Related benchmarks

TaskDatasetResultRank
Molecular property predictionQM9
Cv0.026
70
Molecular property predictionPCQM4M V2
MAE0.078
10
Molecular property predictionMolecule3D (random)
MAE0.0301
9
Molecular property predictionMolecule3D (scaffold)
MAE0.1174
9
Molecular property predictionOLED 100–500 atoms (random)
S1 MAE0.25
7
Showing 5 of 5 rows

Other info

Follow for update