Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space

About

We present evaluation results for FLUX.1 Kontext, a generative flow matching model that unifies image generation and editing. The model generates novel output views by incorporating semantic context from text and image inputs. Using a simple sequence concatenation approach, FLUX.1 Kontext handles both local editing and generative in-context tasks within a single unified architecture. Compared to current editing models that exhibit degradation in character consistency and stability across multiple turns, we observe that FLUX.1 Kontext improved preservation of objects and characters, leading to greater robustness in iterative workflows. The model achieves competitive performance with current state-of-the-art systems while delivering significantly faster generation times, enabling interactive applications and rapid prototyping workflows. To validate these improvements, we introduce KontextBench, a comprehensive benchmark with 1026 image-prompt pairs covering five task categories: local editing, global editing, character reference, style reference and text editing. Detailed evaluations show the superior performance of FLUX.1 Kontext in terms of both single-turn quality and multi-turn consistency, setting new standards for unified image processing models.

Black Forest Labs, Stephen Batifol, Andreas Blattmann, Frederic Boesel, Saksham Consul, Cyril Diagne, Tim Dockhorn, Jack English, Zion English, Patrick Esser, Sumith Kulal, Kyle Lacey, Yam Levi, Cheng Li, Dominik Lorenz, Jonas M\"uller, Dustin Podell, Robin Rombach, Harry Saini, Axel Sauer, Luke Smith• 2025

Related benchmarks

TaskDatasetResultRank
Text-to-Image GenerationGenEval
Overall Score82
704
Image GenerationImageNet 256x256
IS348.9
517
Text-to-Image GenerationGenEval
Overall Score66
517
Text-to-Image GenerationDPG-Bench
Overall Score84
451
Text-to-Image GenerationGenEval
GenEval Score82
442
Text-to-Image GenerationGenEval
Overall Score0.82
277
Image EditingImgEdit-Bench
Overall Score4
224
Text-to-Image GenerationGenEval
Overall Score66
218
Image EditingPIE-Bench
PSNR34.91
215
Text-to-Image GenerationT2I-CompBench
Shape Fidelity51.12
185
Showing 10 of 420 rows
...

Other info

Follow for update