SLASH the Sink: Sharpening Structural Attention Inside LLMs
About
Large Language Models (LLMs) show remarkable semantic understanding but often struggle with structural understanding when processing graph topologies in a serialized format. Existing solutions rely on training external graph-based adapters or fine-tuning, which incur high costs and lost generalizability. In this work, we investigate the internal mechanisms of LLMs and present a critical finding: LLMs spontaneously reconstruct the graph's topology internally, evidenced by a distinct "sawtooth" pattern in their attention maps that structurally aligns with the "token-level adjacency matrix". However, this intrinsic structural understanding is diluted by the attention sink. We theoretically formalize this dilution as a representation bottleneck, stemming from a fundamental conflict: the model's anisotropic bias, essential for language tasks, suppresses the topology-aware local aggregation required for graph reasoning. To address this, we propose a training-free solution, named StructuraL Attention SHarpening (SLASH), which amplifies this internal structural understanding via a plug-and-play attention redistribution. Experiments on pure graph tasks and molecular prediction validate that SLASH delivers significant and consistent performance gains across diverse LLMs.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Graph Reasoning | GraphInstruct (test) | Cycle Accuracy93.1 | 16 | |
| Molecular property prediction | MolecularNet | BACE Score0.65 | 12 | |
| Graph QA | PathQuestion InstructGraph (test) | Accuracy38.6 | 4 | |
| Link Prediction | Wikidata5M InstructGraph (test) | Accuracy26 | 4 | |
| Node Classification | Cora InstructGraph (test) | Accuracy91.4 | 4 |