Clustering as Reasoning: A $k$-Means Interpretation of Chain-of-Thought Graph Learning

About

Chain-of-Thought (CoT) prompting has shown promise in enhancing the reasoning capabilities of large language models (LLMs) on text-attributed graphs (TAGs). This work reframes CoT-based graph learning through the principle of clustering as reasoning, offering a $k$-means interpretation of how iterative reasoning operates over graph-structured data. We observe that existing graph CoT methods rely on disjoint architectures and fixed graph representations, limiting step-by-step semantic-topological interaction and interpretability. To overcome this limitation, we propose a unified framework named KCoT that integrates CoT reasoning with graph representation learning. Our key theoretical result reveals a formal mathematical correspondence between a Transformer block and the $k$-means algorithm, allowing reasoning to be interpreted as iterative assignment and update steps. Based on this insight, we introduce a Semantic Discriminating Prompt that explicitly formulates these steps as structured CoT reasoning, together with a structure-grounded alignment strategy to fuse topological priors with evolving thought-conditioned representations. Experiments on standard benchmarks demonstrate consistent improvements over state-of-the-art methods, validating clustering as a principled mechanism for CoT-based graph learning.

Xuanting Xie, Zhaochen Guo, Bingheng Li, Xingtong Yu, Zhifei Liao, Zhao Kang, Yuan Fang• 2026

Related benchmarks

Task	Dataset	Result
Node Classification	Cora	Accuracy91.64	609
Node Classification	Pubmed	Accuracy96.79	501
Node Classification	arXiv	Accuracy79.26	325
Node Classification	Products	Accuracy86.8	94
Link Prediction	Pubmed	Accuracy95.97	55
Link Prediction	arXiv	--	40
Link Prediction	Products	Accuracy97.3	29

Showing 7 of 7 rows

Other info

Follow for update

@wizwand_team Discord