The quest for the GRAph Level autoEncoder (GRALE)
About
Although graph-based learning has attracted a lot of attention, graph representation learning is still a challenging task whose resolution may impact key application fields such as chemistry or biology. To this end, we introduce GRALE, a novel graph autoencoder that encodes and decodes graphs of varying sizes into a shared embedding space. GRALE is trained using an Optimal Transport-inspired loss that compares the original and reconstructed graphs and leverages a differentiable node matching module, which is trained jointly with the encoder and decoder. The proposed attention-based architecture relies on Evoformer, the core component of AlphaFold, which we extend to support both graph encoding and decoding. We show, in numerical experiments on simulated and molecular data, that GRALE enables a highly general form of pre-training, applicable to a wide range of downstream tasks, from classification and regression to more complex tasks such as graph interpolation, editing, matching, and prediction.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Graph Reconstruction | Coloring w/o ordering | Edit Distance0.11 | 4 | |
| Graph Reconstruction | PubChem16 | Edit Distance0.06 | 4 | |
| Graph Reconstruction | PubChem32 | Edit Distance0.7 | 4 | |
| Graph Reconstruction | QM9 NoHydro | Edit Distance0.05 | 3 | |
| Graph Reconstruction | Coloring with ordering | Edit Distance0.06 | 3 | |
| Regression | BN dataset | RMSE0.0247 | 3 | |
| Regression on molecular properties | PubChem 16 | Dipole Moment MSE0.4029 | 3 | |
| Regression on molecular properties | PubChem32 | Dipole Moment MSE0.3677 | 3 |