T2VUnlearning: A Concept Erasing Method for Text-to-Video Diffusion Models

About

Recent advances in text-to-video (T2V) diffusion models have significantly enhanced the quality of generated videos. However, their capability to produce explicit or harmful content introduces new challenges related to misuse and potential rights violations. To address this newly emerging threat, we propose unlearning-based concept erasing as a solution. First, we adopt negatively-guided velocity prediction fine-tuning and enhance it with prompt augmentation to ensure robustness against prompts refined by large language models (LLMs). Second, to achieve precise unlearning, we incorporate mask-based localization regularization and concept preservation regularization to preserve the model's ability to generate non-target concepts. Extensive experiments demonstrate that our method effectively erases a specific concept while preserving the model's generation capability for all other concepts, outperforming existing methods. We provide the unlearned models in \href{https://github.com/VDIGPKU/T2VUnlearning.git}{https://github.com/VDIGPKU/T2VUnlearning.git}.

Xiaoyu Ye, Songjie Cheng, Yongtao Wang, Yajiao Xiong, Yishen Li• 2025

Related benchmarks

Task	Dataset	Result
Celebrity identity erasure	CogVideoX-2B Celebrity Identities (test)	Identity Similarity Score (Merkel)0.027	6
Video Nudity Erasure	Ring-a-Bell	Nudity Rate6.97	6
Video Nudity Erasure	Gen	Nudity Rate19.73	6
Concept Erasure	CogVideoX-2B Nudity Concepts	Generative Rate19.63	6
Video Generation Quality	VBench	Object Class Acc87	6
Object Erasure	ImageNet	ESR (1% Erasure)92.38	5
Artist Style Erasure	Artist Style Erasure Van Gogh	VCLIPe21.37	5
Artist Style Erasure	Artist Style Erasure Rembrandt	VCLIPe Score0.063	5
Artist Style Erasure	Artist Style Erasure Andy Warhol	VCLIPe0.0717	5
Artist Style Erasure	Artist Style Erasure Caravaggio	VCLIPe Score0.1314	5

Showing 10 of 20 rows

Other info

Follow for update

@wizwand_team Discord