Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

NAGphormer: A Tokenized Graph Transformer for Node Classification in Large Graphs

About

The graph Transformer emerges as a new architecture and has shown superior performance on various graph mining tasks. In this work, we observe that existing graph Transformers treat nodes as independent tokens and construct a single long sequence composed of all node tokens so as to train the Transformer model, causing it hard to scale to large graphs due to the quadratic complexity on the number of nodes for the self-attention computation. To this end, we propose a Neighborhood Aggregation Graph Transformer (NAGphormer) that treats each node as a sequence containing a series of tokens constructed by our proposed Hop2Token module. For each node, Hop2Token aggregates the neighborhood features from different hops into different representations and thereby produces a sequence of token vectors as one input. In this way, NAGphormer could be trained in a mini-batch manner and thus could scale to large graphs. Moreover, we mathematically show that as compared to a category of advanced Graph Neural Networks (GNNs), the decoupled Graph Convolutional Network, NAGphormer could learn more informative node representations from the multi-hop neighborhoods. Extensive experiments on benchmark datasets from small to large are conducted to demonstrate that NAGphormer consistently outperforms existing graph Transformers and mainstream GNNs. Code is available at https://github.com/JHL-HUST/NAGphormer.

Jinsong Chen, Kaiyuan Gao, Gaichao Li, Kun He• 2022

Related benchmarks

TaskDatasetResultRank
Node ClassificationCora
Accuracy82.12
1215
Node ClassificationPubmed
Accuracy89.7
819
Node Classificationogbn-arxiv (test)
Accuracy70.25
433
Node ClassificationPubmed
Accuracy80.14
396
Node ClassificationCiteseer
Accuracy71.47
393
Node ClassificationPhoto
Mean Accuracy95.49
343
Node ClassificationwikiCS
Accuracy77.92
317
Node ClassificationChameleon (test)
Mean Accuracy46.12
297
Node ClassificationActor (test)
Mean Accuracy0.3699
237
Node ClassificationRoman-Empire
Accuracy74.34
206
Showing 10 of 95 rows
...

Other info

Follow for update