Empty SPACE: Cross-Attention Sparsity for Concept Erasure in Diffusion Models
About
Erasing specific concepts from text-to-image diffusion models is essential for avoiding the generation of copyrighted and explicit content. Closed-form concept erasure methods offer a fast alternative to backpropagation-based techniques, but they become less effective when scaling from smaller models such as Stable Diffusion 1.5 to larger models like Stable Diffusion XL. To maintain erasure effectiveness in these larger-scale architectures, we propose SParse cross-Attention-based Concept Erasure (SPACE). SPACE iteratively modifies the cross-attention parameters of a model with a closed-form update that jointly induces sparsity and erases target concepts. By concentrating the concept mapping to a lower-dimensional subspace, SPACE achieves superior erasure efficacy compared to dense baselines. Extensive experimental results show improvements in erasure effectiveness and robustness against adversarial prompts. Furthermore, SPACE achieves 80\%-90\% cross-attention sparsity, reducing the storage requirements for saving the modified parameters by 70\%, demonstrating its memory efficiency.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Nudity Erasure | I2P | Total Count16 | 44 | |
| Nudity Erasure | MMA-Diff tar 1.5 (test) | Nudity Generation Rate5.2 | 26 | |
| Nudity Erasure | I2P 1.5 (test) | Nudity Generation Rate2.8 | 13 | |
| Nudity Erasure | MMA-Diff adv. 1.5 (test) | Nudity Generation Rate6.3 | 13 | |
| Nudity Erasure | Ring-A-Bell 1.5 (test) | Nudity Generation Rate24.5 | 13 | |
| Style Erasure | SDXL Erasing Van Gogh | CS (Van Gogh)30 | 13 | |
| Concept Erasure | I2P | Breast (F) Erasure Rate13 | 7 | |
| Style Erasure | SDXL Erasing Van Gogh and Monet | CS (Van Gogh)30.09 | 6 |