NAGphormer: A Tokenized Graph Transformer for Node Classification in Large Graphs

About

The graph Transformer emerges as a new architecture and has shown superior performance on various graph mining tasks. In this work, we observe that existing graph Transformers treat nodes as independent tokens and construct a single long sequence composed of all node tokens so as to train the Transformer model, causing it hard to scale to large graphs due to the quadratic complexity on the number of nodes for the self-attention computation. To this end, we propose a Neighborhood Aggregation Graph Transformer (NAGphormer) that treats each node as a sequence containing a series of tokens constructed by our proposed Hop2Token module. For each node, Hop2Token aggregates the neighborhood features from different hops into different representations and thereby produces a sequence of token vectors as one input. In this way, NAGphormer could be trained in a mini-batch manner and thus could scale to large graphs. Moreover, we mathematically show that as compared to a category of advanced Graph Neural Networks (GNNs), the decoupled Graph Convolutional Network, NAGphormer could learn more informative node representations from the multi-hop neighborhoods. Extensive experiments on benchmark datasets from small to large are conducted to demonstrate that NAGphormer consistently outperforms existing graph Transformers and mainstream GNNs. Code is available at https://github.com/JHL-HUST/NAGphormer.

Jinsong Chen, Kaiyuan Gao, Gaichao Li, Kun He• 2022

Related benchmarks

Task	Dataset	Result
Node Classification	Cora	Accuracy82.12	1215
Node Classification	Pubmed	Accuracy89.7	865
Node Classification	Pubmed	Accuracy80.14	627
Node Classification	Cora	Accuracy80.92	583
Node Classification	Citeseer	Accuracy71.47	503
Node Classification	ogbn-arxiv (test)	Accuracy70.25	497
Node Classification	Photo	Mean Accuracy95.49	374
Node Classification	Pubmed	Accuracy89.7	363
Node Classification	Chameleon (test)	Mean Accuracy46.12	335
Node Classification	wikiCS	Accuracy77.92	329

Showing 10 of 107 rows

...

Other info

Follow for update

@wizwand_team Discord