Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Social-JEPA: Emergent Geometric Isomorphism

About

World models compress rich sensory streams into compact latent codes that anticipate future observations. We let separate agents acquire such models from distinct viewpoints of the same environment without any parameter sharing or coordination. After training, their internal representations exhibit a striking emergent property: the two latent spaces are related by an approximate linear isometry, enabling transparent translation between them. This geometric consensus survives large viewpoint shifts and scant overlap in raw pixels. Leveraging the learned alignment, a classifier trained on one agent can be ported to the other with no additional gradient steps, while distillation-like migration accelerates later learning and markedly reduces total compute. The findings reveal that predictive learning objectives impose strong regularities on representation geometry, suggesting a lightweight path to interoperability among decentralized vision systems. The code is available at https://anonymous.4open.science/r/Social-JEPA-5C57.

Haoran Zhang, Youjin Wang, Yi Duan, Rong Fu, Dianyu Zhao, Sicheng Fan, Shuaishuai Cao, Wentao Guo, Xiao Zhou• 2026

Related benchmarks

TaskDatasetResultRank
Representation AlignmentImageNet-1K
MSE0.091
9
Latent IsomorphismImageNet-1K
MSE0.091
6
Latent IsomorphismSmallnorb
MSE0.036
6
Showing 3 of 3 rows

Other info

Follow for update