Graph Rationalization with Environment-based Augmentations
About
Rationale is defined as a subset of input features that best explains or supports the prediction by machine learning models. Rationale identification has improved the generalizability and interpretability of neural networks on vision and language data. In graph applications such as molecule and polymer property prediction, identifying representative subgraph structures named as graph rationales plays an essential role in the performance of graph neural networks. Existing graph pooling and/or distribution intervention methods suffer from lack of examples to learn to identify optimal graph rationales. In this work, we introduce a new augmentation operation called environment replacement that automatically creates virtual data examples to improve rationale identification. We propose an efficient framework that performs rationale-environment separation and representation learning on the real and augmented examples in latent spaces to avoid the high complexity of explicit graph decoding and encoding. Comparing against recent techniques, experiments on seven molecular and four polymer real datasets demonstrate the effectiveness and efficiency of the proposed augmentation-based graph rationalization framework.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Graph Classification | MolHIV | ROC AUC60.71 | 82 | |
| Graph Classification | Accuracy59.92 | 57 | ||
| Graph Classification | DrugOOD EC50 (OOD test) | ROC AUC71.15 | 52 | |
| Graph Classification | DrugOOD EC50 (Scaffold-based OOD shift) | ROC AUC63.79 | 36 | |
| Graph Classification | DrugOOD Ki-Sca (Scaffold-based OOD shift) | ROC-AUC67.82 | 36 | |
| Molecular property prediction | BACE | ROC-AUC82.37 | 35 | |
| Molecular property prediction | BBBP | ROC AUC0.6986 | 35 | |
| Molecular property prediction | ClinTox | ROC AUC89.61 | 34 | |
| Graph Classification | Molbbbp (scaffold) | ROC-AUC69.72 | 31 | |
| Graph Classification | Motif (size) | Accuracy54.13 | 29 |