LURE: Latent Space Unblocking for Multi-Concept Reawakening in Diffusion Models
About
Concept erasure aims to suppress sensitive content in diffusion models, but recent studies show that erased concepts can still be reawakened, revealing vulnerabilities in erasure methods. Existing reawakening methods mainly rely on prompt-level optimization to manipulate sampling trajectories, neglecting other generative factors, which limits a comprehensive understanding of the underlying dynamics. In this paper, we model the generation process as an implicit function to enable a comprehensive theoretical analysis of multiple factors, including text conditions, model parameters, and latent states. We theoretically show that perturbing each factor can reawaken erased concepts. Building on this insight, we propose a novel concept reawakening method: Latent space Unblocking for concept REawakening (LURE), which reawakens erased concepts by reconstructing the latent space and guiding the sampling trajectory. Specifically, our semantic re-binding mechanism reconstructs the latent space by aligning denoising predictions with target distributions to reestablish severed text-visual associations. However, in multi-concept scenarios, naive reconstruction can cause gradient conflicts and feature entanglement. To address this, we introduce Gradient Field Orthogonalization, which enforces feature orthogonality to prevent mutual interference. Additionally, our Latent Semantic Identification-Guided Sampling (LSIS) ensures stability of the reawakening process via posterior density verification. Extensive experiments demonstrate that LURE enables simultaneous, high-fidelity reawakening of multiple erased concepts across diverse erasure tasks and methods.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Concept Reawakening | ImageNette 4-concept erasure: English Springer, French Horn, Golf Ball, Parachute SD v1.4 base (test) | Accuracy87.03 | 7 | |
| Concept Reawakening | Unsafe Content | Blood27.14 | 7 | |
| Concept Reawakening | ImageNette Parachute | CLIP Score33.16 | 7 | |
| Concept Reawakening | ImageNette Tench | CLIP Score34.01 | 7 | |
| Semantic Alignment | Celebrity Identities | Taylor Swift Alignment Score30.11 | 7 | |
| Semantic Alignment | Intellectual Property Characters | Snoopy Alignment Score33.86 | 7 | |
| Concept Reawakening | ImageNette English Springer | CLIP Score32.9 | 7 | |
| Concept Reawakening | ImageNette 2-concept erasure: English Springer, Golf Ball SD v1.4 base (test) | Accuracy87.83 | 6 |