Where Concept Erasure Should Occur: Concept-Layer Alignment in Text-to-Video Diffusion Models
About
Text-to-video diffusion transformers encode semantic information unevenly across model depth, which constrains effective concept erasure. We identify a representational bottleneck, termed concept-layer topological alignment, under which target concepts exhibit higher separability at certain representational depths. Outside these depths, concept and non-target signals remain strongly entangled, limiting the effectiveness of depth-specific erasure. This observation reframes concept erasure as the problem of identifying representational depths where concept-non-target separation naturally emerges. Motivated by this structural constraint, we introduce CLEAR, a separability-driven optimization framework for concept erasure that explicitly enforces concept-layer alignment. CLEAR operationalizes this principle by formulating layer selection as an optimization problem over concept-non-target separability, rather than relying on layer-agnostic or heuristic choices. To enable this, we introduce a separability-aware objective that favors layers exhibiting stronger concept-non-target separation. Experiments on large-scale text-to-video diffusion models demonstrate that enforcing concept--layer alignment leads to more precise concept suppression while preserving overall generative quality.
Related benchmarks
| Task | Dataset | Result | Rank | |
|---|---|---|---|---|
| Nudity Erasure | Ring-a-Bell | Generation Rate25.6 | 17 | |
| Celebrity identity erasure | CogVideoX-2B Celebrity Identities (test) | Identity Similarity Score (Merkel)0.0819 | 6 | |
| Concept Erasure | CogVideoX-2B Nudity Concepts | Generative Rate14.63 | 6 | |
| Artist Style Erasure | Artist Style Pablo Picasso CogVideoX-2B (test) | VCLIPe0.1915 | 5 | |
| Artist Style Erasure | Artist Style Andy Warhol CogVideoX-2B (test) | VCLIPe Score0.1449 | 5 | |
| Artist Style Erasure | Artist Style Erasure Rembrandt | VCLIPe Score0.0593 | 5 | |
| Artist Style Erasure | Artist Style Erasure Andy Warhol | VCLIPe0.0669 | 5 | |
| Artist Style Erasure | Artist Style Rembrandt CogVideoX-2B (test) | VCLIPe Score6.39 | 5 | |
| Celebrity identity erasure | Wan 5B 2.2 | Merkel Identity Erasure Score-0.0204 | 5 | |
| Artist Style Erasure | Artist Style Erasure Pablo Picasso | VCLIPe0.1397 | 5 |