Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

UniReason 1.0: A Unified Reasoning Framework for World Knowledge Aligned Image Generation and Editing

About

Unified multimodal models often struggle with complex synthesis tasks that demand deep reasoning, and typically treat text-to-image generation and image editing as isolated capabilities rather than interconnected reasoning steps. To address this, we propose UniReason, a unified framework that harmonizes these two tasks through two complementary reasoning paradigms. We incorporate world knowledge-enhanced textual reasoning into generation to infer implicit knowledge, and leverage editing capabilities for fine-grained editing-like visual refinement to further correct visual errors via self-reflection. This approach unifies generation and editing within a shared architecture, mirroring the human cognitive process of planning followed by refinement. We support this framework by systematically constructing a large-scale reasoning-centric dataset (~300k samples) covering five major knowledge domains (e.g., cultural commonsense, physics, etc.) for textual reasoning, alongside an agent-generated corpus for visual refinement. Extensive experiments demonstrate that UniReason achieves advanced performance on reasoning-intensive benchmarks such as WISE, KrisBench and UniREditBench, while maintaining superior general synthesis capabilities.

Dianyi Wang, Chaofan Ma, Feng Han, Size Wu, Wei Song, Yibin Wang, Zhixiong Zhang, Tianhang Wang, Siyuan Wang, Zhongyu Wei, Jiaqi Wang• 2026

Related benchmarks

TaskDatasetResultRank
Text-to-Image GenerationGenEval
Overall Score90
467
Text-to-Image GenerationDPG
Overall Score86.21
131
Image EditingGEdit-Bench English
G_O (Overall Quality)6.94
73
Text-to-Image GenerationWISE (test)
Overall Score78
32
Knowledge-intensive image editingKrisBench (test)
Factual Accuracy70.67
8
Knowledge-intensive image editingUniREditBench (test)
Real World Score74.82
7
Showing 6 of 6 rows

Other info

Follow for update