B-cos GNNs: Faithful Explanations through Dynamic Linearity

About

We introduce B-cos GNNs, an inherently explainable class of graph neural networks whose predictions decompose exactly into per-node, per-feature contributions via a single input-dependent linear map. B-cos GNNs use linear (sum-based) aggregation and replace non-linear message and update functions with B-cos transforms. This induces meaningful, task-specific weight-input alignment that is directly accessible through the model's dynamic linearity. Instance-level explanations follow from a single forward and backward pass, requiring no auxiliary explainer, modified learning objective, or perturbation procedure. Instantiated as a GIN, our approach trades small losses in predictive accuracy for state-of-the-art explainability across diverse synthetic and real-world benchmarks, producing explanations orders of magnitude faster than post-hoc baselines.

Joschka Gro{\ss}, Mohammad Shaique Solanki, Verena Wolf• 2026

Related benchmarks

Task	Dataset	Result
Graph Explanation	BA-2Motif (test)	Jaccard Index84	4
Graph Explanation	Di-Halo-Benzene (test)	Jaccard Index0.96	4
Graph Explanation	MNIST-75SP (test)	Jaccard Index91	4

Showing 3 of 3 rows

Other info

Follow for update

@wizwand_team Discord