Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GraphTARIF: Linear Graph Transformer with Augmented Rank and Improved Focus

About

Linear attention mechanisms have emerged as efficient alternatives to full self-attention in Graph Transformers, offering linear time complexity. However, existing linear attention models often suffer from a significant drop in expressiveness due to low-rank projection structures and overly uniform attention distributions. We theoretically prove that these properties reduce the class separability of node representations, limiting the model's classification ability. To address this, we propose a novel hybrid framework that enhances both the rank and focus of attention. Specifically, we enhance linear attention by attaching a gated local graph network branch to the value matrix, thereby increasing the rank of the resulting attention map. Furthermore, to alleviate the excessive smoothing effect inherent in linear attention, we introduce a learnable log-power function into the attention scores to reduce entropy and sharpen focus. We theoretically show that this function decreases entropy in the attention distribution, enhancing the separability of learned embeddings. Extensive experiments on both homophilic and heterophilic graph benchmarks demonstrate that our method achieves competitive performance while preserving the scalability of linear attention.

Zhaolin Hu, Kun Li, Hehe Fan, Yi Yang• 2025

Related benchmarks

TaskDatasetResultRank
Node ClassificationSquirrel
Accuracy45.58
786
Node ClassificationRoman-Empire
Accuracy93.23
327
Node Classificationamazon-ratings
Accuracy55.86
309
Node ClassificationCS
Accuracy96.51
175
Node ClassificationComputer
Accuracy94.61
159
Node ClassificationPhoto
Accuracy97.03
153
Node ClassificationMinesweeper
Accuracy99.03
113
Node ClassificationPhysics
Accuracy97.39
79
Showing 8 of 8 rows

Other info

Follow for update