Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Where Concept Erasure Should Occur: Concept-Layer Alignment in Text-to-Video Diffusion Models

About

Text-to-video diffusion transformers encode semantic information unevenly across model depth, which constrains effective concept erasure. We identify a representational bottleneck, termed concept-layer topological alignment, under which target concepts exhibit higher separability at certain representational depths. Outside these depths, concept and non-target signals remain strongly entangled, limiting the effectiveness of depth-specific erasure. This observation reframes concept erasure as the problem of identifying representational depths where concept-non-target separation naturally emerges. Motivated by this structural constraint, we introduce CLEAR, a separability-driven optimization framework for concept erasure that explicitly enforces concept-layer alignment. CLEAR operationalizes this principle by formulating layer selection as an optimization problem over concept-non-target separability, rather than relying on layer-agnostic or heuristic choices. To enable this, we introduce a separability-aware objective that favors layers exhibiting stronger concept-non-target separation. Experiments on large-scale text-to-video diffusion models demonstrate that enforcing concept--layer alignment leads to more precise concept suppression while preserving overall generative quality.

Yiwei Xie, Ping Liu, Zheng Zhang• 2026

Related benchmarks

TaskDatasetResultRank
Nudity ErasureRing-a-Bell
Generation Rate25.6
17
Celebrity identity erasureCogVideoX-2B Celebrity Identities (test)
Identity Similarity Score (Merkel)0.0819
6
Concept ErasureCogVideoX-2B Nudity Concepts
Generative Rate14.63
6
Artist Style ErasureArtist Style Pablo Picasso CogVideoX-2B (test)
VCLIPe0.1915
5
Artist Style ErasureArtist Style Andy Warhol CogVideoX-2B (test)
VCLIPe Score0.1449
5
Artist Style ErasureArtist Style Erasure Rembrandt
VCLIPe Score0.0593
5
Artist Style ErasureArtist Style Erasure Andy Warhol
VCLIPe0.0669
5
Artist Style ErasureArtist Style Rembrandt CogVideoX-2B (test)
VCLIPe Score6.39
5
Celebrity identity erasureWan 5B 2.2
Merkel Identity Erasure Score-0.0204
5
Artist Style ErasureArtist Style Erasure Pablo Picasso
VCLIPe0.1397
5
Showing 10 of 17 rows

Other info

Follow for update