Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space

About

We present evaluation results for FLUX.1 Kontext, a generative flow matching model that unifies image generation and editing. The model generates novel output views by incorporating semantic context from text and image inputs. Using a simple sequence concatenation approach, FLUX.1 Kontext handles both local editing and generative in-context tasks within a single unified architecture. Compared to current editing models that exhibit degradation in character consistency and stability across multiple turns, we observe that FLUX.1 Kontext improved preservation of objects and characters, leading to greater robustness in iterative workflows. The model achieves competitive performance with current state-of-the-art systems while delivering significantly faster generation times, enabling interactive applications and rapid prototyping workflows. To validate these improvements, we introduce KontextBench, a comprehensive benchmark with 1026 image-prompt pairs covering five task categories: local editing, global editing, character reference, style reference and text editing. Detailed evaluations show the superior performance of FLUX.1 Kontext in terms of both single-turn quality and multi-turn consistency, setting new standards for unified image processing models.

Black Forest Labs, Stephen Batifol, Andreas Blattmann, Frederic Boesel, Saksham Consul, Cyril Diagne, Tim Dockhorn, Jack English, Zion English, Patrick Esser, Sumith Kulal, Kyle Lacey, Yam Levi, Cheng Li, Dominik Lorenz, Jonas M\"uller, Dustin Podell, Robin Rombach, Harry Saini, Axel Sauer, Luke Smith• 2025

Related benchmarks

TaskDatasetResultRank
Text-to-Image GenerationGenEval
Overall Score66
506
Text-to-Image GenerationGenEval
Overall Score82
391
Text-to-Image GenerationGenEval
GenEval Score82
360
Image GenerationImageNet 256x256
IS348.9
359
Text-to-Image GenerationGenEval
Overall Score66
218
Image EditingImgEdit-Bench
Overall Score4
191
Text-to-Image GenerationT2I-CompBench
Shape Fidelity51.12
185
Image EditingPIE-Bench
PSNR34.91
166
Image ReconstructionCOCO 2017 (val)
PSNR30.89
123
Image ReconstructionImageNet (val)
rFID0.176
95
Showing 10 of 312 rows
...

Other info

Follow for update