struc2vec: Learning Node Representations from Structural Identity
About
Structural identity is a concept of symmetry in which network nodes are identified according to the network structure and their relationship to other nodes. Structural identity has been studied in theory and practice over the past decades, but only recently has it been addressed with representational learning techniques. This work presents struc2vec, a novel and flexible framework for learning latent representations for the structural identity of nodes. struc2vec uses a hierarchy to measure node similarity at different scales, and constructs a multilayer graph to encode structural similarities and generate structural context for nodes. Numerical experiments indicate that state-of-the-art techniques for learning node representations fail in capturing stronger notions of structural identity, while struc2vec exhibits much superior performance in this task, as it overcomes limitations of prior approaches. As a consequence, numerical experiments indicate that struc2vec improves performance on classification tasks that depend more on structural identity.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Node Classification | Pubmed | -- | 742 | |
| Node Classification | Cora | Micro F10.3601 | 42 | |
| Node Classification | DD68 | Micro F113.71 | 32 | |
| Node Classification | dd687 | Micro F110.3 | 32 | |
| Node Classification | Wisconsin | Micro F17.98 | 32 | |
| Node Classification | Cornell | Micro F145.39 | 32 | |
| Node Classification | Brazil | Micro F138.39 | 32 | |
| Node Classification | DD242 | Micro F125.54 | 32 | |
| Gene-Disease Association Prediction | DisGeNET curated GDAs benchmark Graph 5 (test) | ROC-AUC0.909 | 14 | |
| Page Classification | Wikipedia (90% train ratio) | Macro-F1 Score19 | 13 |