Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Z-Erase: Enabling Concept Erasure in Single-Stream Diffusion Transformers

About

Concept erasure serves as a vital safety mechanism for removing unwanted concepts from text-to-image (T2I) models. While extensively studied in U-Net and dual-stream architectures (e.g., Flux), this task remains under-explored in the recent emerging paradigm of single-stream diffusion transformers (e.g., Z-Image). In this new paradigm, text and image tokens are processed as a single unified sequence via shared parameters. Consequently, directly applying prior erasure methods typically leads to generation collapse. To bridge this gap, we introduce Z-Erase, the first concept erasure method tailored for single-stream T2I models. To guarantee stable image generation, Z-Erase first proposes a Stream Disentangled Concept Erasure Framework that decouples updates and enables existing methods on single-stream models. Subsequently, within this framework, we introduce Lagrangian-Guided Adaptive Erasure Modulation, a constrained algorithm that further balances the sensitive erasure-preservation trade-off. Moreover, we provide a rigorous convergence analysis proving that Z-Erase can converge to a Pareto stationary point. Experiments demonstrate that Z-Erase successfully overcomes the generation collapse issue, achieving state-of-the-art performance across a wide range of tasks.

Nanxiang Jiang, Zhaoxin Fan, Baisen Wang, Daiheng Gao, Junhang Cheng, Jifeng Guo, Yalan Qin, Yeying Jin, Hongwei Zheng, Faguo Wu, Wenjun Wu• 2026

Related benchmarks

TaskDatasetResultRank
Nudity ErasureI2P
Total Count161
38
Utility PreservationMS-COCO 10k
FID27.62
22
Utility PreservationCOCO-10K (val)
FID26.46
20
Concept ErasureCurated Artistic Style
ACCe26.2
14
Violence ErasureI2P
Total324
12
Concept ErasureEntity Category (e.g., church)
Accuracy23.1
10
Celebrity ErasureCelebA 100 identities
ACCe23.52
10
Concept ErasureAbstraction Category color
ACCe24.3
10
Concept ErasureRing-A-Bell (285 prompts)
Attack Success Rate (w/o ATTACK)10.52
5
Concept ErasureCurated Entity
Accuracy (e)24.3
4
Showing 10 of 12 rows

Other info

Follow for update