Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Laplacian Multi-scale Flow Matching for Generative Modeling

About

In this paper, we present Laplacian multiscale flow matching (LapFlow), a novel framework that enhances flow matching by leveraging multi-scale representations for image generative modeling. Our approach decomposes images into Laplacian pyramid residuals and processes different scales in parallel through a mixture-of-transformers (MoT) architecture with causal attention mechanisms. Unlike previous cascaded approaches that require explicit renoising between scales, our model generates multi-scale representations in parallel, eliminating the need for bridging processes. The proposed multi-scale architecture not only improves generation quality but also accelerates the sampling process and promotes scaling flow matching methods. Through extensive experimentation on CelebA-HQ and ImageNet, we demonstrate that our method achieves superior sample quality with fewer GFLOPs and faster inference compared to single-scale and multi-scale flow matching baselines. The proposed model scales effectively to high-resolution generation (up to 1024$\times$1024) while maintaining lower computational overhead.

Zelin Zhao, Petr Molodyk, Haotian Xue, Yongxin Chen• 2026

Related benchmarks

TaskDatasetResultRank
Class-conditional Image GenerationImageNet 256x256--
441
Unconditional Image GenerationCelebA-HQ 256x256
Fréchet Distance (FD)3.53
27
Unconditional Image GenerationCelebA-HQ 512 x 512
FID4.04
2
Unconditional Image GenerationCelebA-HQ 1024 x 1024
FID5.51
2
Showing 4 of 4 rows

Other info

Follow for update