Z-Erase: Enabling Concept Erasure in Single-Stream Diffusion Transformers
About
Concept erasure serves as a vital safety mechanism for removing unwanted concepts from text-to-image (T2I) models. While extensively studied in U-Net and dual-stream architectures (e.g., Flux), this task remains under-explored in the recent emerging paradigm of single-stream diffusion transformers (e.g., Z-Image). In this new paradigm, text and image tokens are processed as a single unified sequence via shared parameters. Consequently, directly applying prior erasure methods typically leads to generation collapse. To bridge this gap, we introduce Z-Erase, the first concept erasure method tailored for single-stream T2I models. To guarantee stable image generation, Z-Erase first proposes a Stream Disentangled Concept Erasure Framework that decouples updates and enables existing methods on single-stream models. Subsequently, within this framework, we introduce Lagrangian-Guided Adaptive Erasure Modulation, a constrained algorithm that further balances the sensitive erasure-preservation trade-off. Moreover, we provide a rigorous convergence analysis proving that Z-Erase can converge to a Pareto stationary point. Experiments demonstrate that Z-Erase successfully overcomes the generation collapse issue, achieving state-of-the-art performance across a wide range of tasks.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Nudity Erasure | I2P | Total Count161 | 38 | |
| Utility Preservation | MS-COCO 10k | FID27.62 | 22 | |
| Utility Preservation | COCO-10K (val) | FID26.46 | 20 | |
| Concept Erasure | Curated Artistic Style | ACCe26.2 | 14 | |
| Violence Erasure | I2P | Total324 | 12 | |
| Concept Erasure | Entity Category (e.g., church) | Accuracy23.1 | 10 | |
| Celebrity Erasure | CelebA 100 identities | ACCe23.52 | 10 | |
| Concept Erasure | Abstraction Category color | ACCe24.3 | 10 | |
| Concept Erasure | Ring-A-Bell (285 prompts) | Attack Success Rate (w/o ATTACK)10.52 | 5 | |
| Concept Erasure | Curated Entity | Accuracy (e)24.3 | 4 |