Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Rectified-CFG++ for Flow Based Models

About

Classifier-free guidance (CFG) is the workhorse for steering large diffusion models toward text-conditioned targets, yet its native application to rectified flow (RF) based models provokes severe off-manifold drift, yielding visual artifacts, text misalignment, and brittle behaviour. We present Rectified-CFG++, an adaptive predictor-corrector guidance that couples the deterministic efficiency of rectified flows with a geometry-aware conditioning rule. Each inference step first executes a conditional RF update that anchors the sample near the learned transport path, then applies a weighted conditional correction that interpolates between conditional and unconditional velocity fields. We prove that the resulting velocity field is marginally consistent and that its trajectories remain within a bounded tubular neighbourhood of the data manifold, ensuring stability across a wide range of guidance strengths. Extensive experiments on large-scale text-to-image models (Flux, Stable Diffusion 3/3.5, Lumina) show that Rectified-CFG++ consistently outperforms standard CFG on benchmark datasets such as MS-COCO, LAION-Aesthetic, and T2I-CompBench. Project page: https://rectified-cfgpp.github.io/

Shreshth Saini, Shashank Gupta, Alan C. Bovik• 2025

Related benchmarks

TaskDatasetResultRank
Text-to-Image GenerationGenEval
Overall Score59.57
517
Text-to-Image GenerationPick-a-Pic
PickScore22.37
150
Text-to-Image GenerationDrawBench
Pick Score23.15
40
Text-to-Image GenerationText-to-Image Generation Dataset
FID32.4
30
Text-to-Image GenerationLAION 5B 1K
HPSv2.128.306
18
Text-to-Image GenerationMS COCO 1K
HPSv2.128.932
18
Shape completionMedShapeNet (test)
Dice Score56.93
16
Text to ImageMS-COCO 5k image-text pairs
FID20.55
15
Showing 8 of 8 rows

Other info

Follow for update