Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Geometry-Aware Image Flow Matching

About

Recent advances in generative models highlight the power of geometry-aware modeling in manifold-constrained settings. Yet, for natural images, the field remains confined to Euclidean assumptions, failing to exploit the potential of intrinsic geometric structures within the data. In this work, we investigate the geometry of natural images and observe that semantic information is predominantly encoded in directional components, while norm components can be approximated by the global average. This property holds across both RGB and latent spaces, suggesting that natural images can be effectively modeled on a hypersphere. Building on this finding, we introduce Spherical Optimal Transport Flow Matching (SOT-CFM), which utilizes angular distance, and Spherical Flow Matching (SFM), which constrains dynamics directly on the manifold. Our experiments demonstrate that these geometry-aware methods achieve superior performance against Euclidean baselines. Ultimately, this work provides a novel perspective that bridges the gap between Riemannian manifold-based modeling and natural image generation.

Junho Lee, Kwanseok Kim, Joonseok Lee• 2026

Related benchmarks

TaskDatasetResultRank
Class-conditional Image GenerationImageNet class-conditional 256x256
Inception Score (IS)337.9
61
Image ClassificationImageNet 256x256 (val)--
16
Image GenerationCIFAR-10
gFID3.79
6
Image ClassificationImageNet 50,000 generated samples 256x256 (test)
Top-1 Accuracy87.13
3
Showing 4 of 4 rows

Other info

Follow for update