FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space

About

We present evaluation results for FLUX.1 Kontext, a generative flow matching model that unifies image generation and editing. The model generates novel output views by incorporating semantic context from text and image inputs. Using a simple sequence concatenation approach, FLUX.1 Kontext handles both local editing and generative in-context tasks within a single unified architecture. Compared to current editing models that exhibit degradation in character consistency and stability across multiple turns, we observe that FLUX.1 Kontext improved preservation of objects and characters, leading to greater robustness in iterative workflows. The model achieves competitive performance with current state-of-the-art systems while delivering significantly faster generation times, enabling interactive applications and rapid prototyping workflows. To validate these improvements, we introduce KontextBench, a comprehensive benchmark with 1026 image-prompt pairs covering five task categories: local editing, global editing, character reference, style reference and text editing. Detailed evaluations show the superior performance of FLUX.1 Kontext in terms of both single-turn quality and multi-turn consistency, setting new standards for unified image processing models.

Black Forest Labs, Stephen Batifol, Andreas Blattmann, Frederic Boesel, Saksham Consul, Cyril Diagne, Tim Dockhorn, Jack English, Zion English, Patrick Esser, Sumith Kulal, Kyle Lacey, Yam Levi, Cheng Li, Dominik Lorenz, Jonas M\"uller, Dustin Podell, Robin Rombach, Harry Saini, Axel Sauer, Luke Smith• 2025

Related benchmarks

Task	Dataset	Result
Text-to-Image Generation	GenEval	Overall Score82	914
Image Generation	ImageNet 256x256	IS348.9	606
Text-to-Image Generation	GenEval	Overall Score67	581
Text-to-Image Generation	DPG-Bench	Overall Score84	510
Text-to-Image Generation	GenEval	GenEval Score82	459
Text-to-Image Generation	GenEval	Overall Score0.82	318
Image Editing	PIE-Bench	PSNR34.91	257
Image Editing	ImgEdit-Bench	Overall Score4	256
Text-to-Image Generation	GenEval	Overall Score66	218
Text-to-Image Generation	T2I-CompBench	Shape Fidelity51.12	185

Showing 10 of 485 rows

...

Other info

Follow for update

@wizwand_team Discord