SLASH the Sink: Sharpening Structural Attention Inside LLMs

About

Large Language Models (LLMs) show remarkable semantic understanding but often struggle with structural understanding when processing graph topologies in a serialized format. Existing solutions rely on training external graph-based adapters or fine-tuning, which incur high costs and lost generalizability. In this work, we investigate the internal mechanisms of LLMs and present a critical finding: LLMs spontaneously reconstruct the graph's topology internally, evidenced by a distinct "sawtooth" pattern in their attention maps that structurally aligns with the "token-level adjacency matrix". However, this intrinsic structural understanding is diluted by the attention sink. We theoretically formalize this dilution as a representation bottleneck, stemming from a fundamental conflict: the model's anisotropic bias, essential for language tasks, suppresses the topology-aware local aggregation required for graph reasoning. To address this, we propose a training-free solution, named StructuraL Attention SHarpening (SLASH), which amplifies this internal structural understanding via a plug-and-play attention redistribution. Experiments on pure graph tasks and molecular prediction validate that SLASH delivers significant and consistent performance gains across diverse LLMs.

Yiming Liu, Bin Lu, Xinbing Wang, Chenghu Zhou, Meng Jin• 2026

Related benchmarks

Task	Dataset	Result
Graph Reasoning	GraphInstruct (test)	Cycle Accuracy93.1	16
Molecular property prediction	MolecularNet	BACE Score0.65	12
Graph QA	PathQuestion InstructGraph (test)	Accuracy38.6	4
Link Prediction	Wikidata5M InstructGraph (test)	Accuracy26	4
Node Classification	Cora InstructGraph (test)	Accuracy91.4	4

Showing 5 of 5 rows

Other info

Follow for update

@wizwand_team Discord