Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Efficient Synthetic Network Generation via Latent Embedding Reconstruction

About

Network data are ubiquitous across the social sciences, biology, and information systems. Generating realistic synthetic network data has broad applications from network simulation to scientific discovery. However, many existing black-box approaches for network generation tend to overfit observed data while overlooking characteristic network structure, and incur substantial computational overhead at scale. These practical challenges call for synthetic network generation methods that are both efficient and capable of capturing structural properties of networks. In this paper, we introduce Synthetic Network Generation via Latent Embedding Reconstruction (SyNGLER), a general and efficient framework for synthetic network generation that builds on latent space network models. Given an observed network, SyNGLER first learns low-dimensional latent node embeddings via a latent space network model and then reconstructs the latent space by building a distribution-free generator over these embeddings. For generation, SyNGLER first samples (or resamples) node embeddings from the generator in the latent space and then produces synthetic networks using the latent space network model. Through the latent space framework, SyNGLER preserves unique characteristics in networks such as sparsity and node degree heterogeneity, while allowing for efficient training with lower computational cost than many existing deep architectures. We provide theoretical guarantees by developing consistency results on the distance between the true and synthetic edge distributions. Empirical studies further demonstrate the effectiveness of SyNGLER, which efficiently produces networks that better preserve key network characteristics such as network moments and degree distributions compared with existing approaches. Code is available at https://github.com/FeifanJiang/syngler.

Feifan Jiang, Yinan Bu, Shihao Wu, Gongjun Xu, Ji Zhu• 2026

Related benchmarks

TaskDatasetResultRank
Synthetic Network GenerationPolBlogs
GFDL1 Score0.72
29
Synthetic Network GenerationDBLP
GFDL19.51
29
Synthetic Network GenerationYouTube
GFDL1 Score3.02
29
Synthetic Network GenerationYelp
GFDL1 Link Prediction Accuracy0.42
25
Synthetic Network GenerationYouTube
Eigenvalue MMD0.38
14
Synthetic Network GenerationPolBlogs
Triangle Similarity (MMD)0.59
11
Synthetic Network GenerationDBLP
Transitivity3.63
11
Synthetic Network GenerationDBLP
Link Prediction Accuracy100
10
Synthetic Network GenerationYouTube
Link Prediction Accuracy Ratio99
10
Synthetic Network GenerationYelp friendship network
Triangle Ratio (10^-4)2
9
Showing 10 of 23 rows

Other info

Follow for update