Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow

About

Semantic segmentation and semantic image synthesis are two representative tasks in visual perception and generation. While existing methods consider them as two distinct tasks, we propose a unified framework (SemFlow) and model them as a pair of reverse problems. Specifically, motivated by rectified flow theory, we train an ordinary differential equation (ODE) model to transport between the distributions of real images and semantic masks. As the training object is symmetric, samples belonging to the two distributions, images and semantic masks, can be effortlessly transferred reversibly. For semantic segmentation, our approach solves the contradiction between the randomness of diffusion outputs and the uniqueness of segmentation results. For image synthesis, we propose a finite perturbation approach to enhance the diversity of generated results without changing the semantic categories. Experiments show that our SemFlow achieves competitive results on semantic segmentation and semantic image synthesis tasks. We hope this simple framework will motivate people to rethink the unification of low-level and high-level vision.

Chaoyang Wang, Xiangtai Li, Lu Qi, Henghui Ding, Yunhai Tong, Ming-Hsuan Yang• 2024

Related benchmarks

TaskDatasetResultRank
Semantic segmentationCOCO Stuff
mIoU38.6
379
Semantic Image SynthesisCOCO Stuff
FID90
49
Semantic Image SynthesisCelebAMask-HQ
FID32.6
33
Semantic segmentationCelebAMask-HQ
mIoU69.4
7
SegmentationCRACK500
mIoU17.2
7
SegmentationCrackTree260
mIoU4.5
7
SegmentationCrackLS315
mIoU4.1
7
SegmentationDRIVE
mIoU29.5
7
SegmentationXCAD
mIoU15.7
7
Showing 9 of 9 rows

Other info

Code

Follow for update