Provably Communication-Efficient and Privacy-Preserving Federated Graph Neural Networks
About
Graph neural networks (GNNs) achieve strong performance on relational data, but real-world graphs are often distributed across organizations that cannot share raw data due to privacy and policy constraints. Existing federated GNN methods either ignore cross-client links, leading to degraded accuracy, or require frequent embedding exchanges, incurring substantial communication and privacy costs. We propose CE-FedGNN, a communication-efficient and privacy-preserving federated GNN framework for learning over such coupled graphs. Our approach avoids sharing raw data or per-round embeddings by infrequently exchanging aggregated node representations. To handle cross-client dependency and staleness, we introduce a moving-average estimator that continuously tracks node representations and enables their stable reuse across rounds. To provide formal privacy guarantees for the released representations, we adopt the metric differential privacy (metric-DP) framework, which measures privacy with respect to distances in the learned embedding space rather than worst-case input perturbations. This yields meaningful guarantees at noise levels where standard differential privacy becomes overly conservative. We establish convergence to a stationary point at a rate of $O(1/\sqrt{T})$ with $O(T^{3/4})$ communication complexity. In addition, we derive $(\varepsilon,\delta)$-metric-DP guarantees via R\'enyi differential privacy composition under a public-cohort threat model. Experiments on synthetic interbank anti-money laundering benchmarks and citation networks demonstrate that CE-FedGNN achieves strong performance while significantly reducing communication and maintaining robustness under privacy-preserving noise.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Node Classification | Cora | Macro-F147.01 | 30 | |
| Node Classification | Citeseer | F1 Score43.43 | 27 | |
| AML Node Classification | Synthetic AML HI-Small | Average F1 Score66.23 | 12 | |
| AML Node Classification | Synthetic AML HI-Medium | Average F165.17 | 12 | |
| AML Node Classification | Synthetic AML HI-Large | Average F1 Score71.14 | 12 | |
| AML Node Classification | Synthetic AML LI-Small | Average F1 Score26.55 | 12 | |
| AML Node Classification | Synthetic AML LI-Medium | Avg F1 Score0.2918 | 12 | |
| AML Node Classification | Synthetic AML LI-Large | Average F1 Score31.58 | 12 | |
| Node Classification | Pubmed | Average F165.17 | 8 | |
| Node Classification | MSAcademic | Average F184.97 | 8 |