Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MotionCFG: Boosting Motion Dynamics via Stochastic Concept Perturbation

About

Despite recent advances in Text-to-Video (T2V) synthesis, generating high-fidelity and dynamic motion remains a significant challenge. Existing methods primarily rely on Classifier-Free Guidance (CFG), often with explicit negative prompts (e.g. "static", "blurry"), to suppress undesired artifacts. However, such explicit negations frequently introduce unintended semantic bias and distort object integrity; a phenomenon we define as Content-Motion Drift. To address this, we propose MotionCFG, a framework that enhances motion dynamics by contrasting a target concept with its noise-perturbed counterparts. Specifically, by injecting Gaussian noise into the concept embeddings, MotionCFG creates localized negative anchors that encapsulate a broad complementary space of sub-optimal motion variations. Unlike explicit negations, this approach facilitates implicit hard negative mining without shifting the global semantic identity, allowing for a focused refinement of temporal details. Combined with a piecewise guidance schedule that confines intervention to the early denoising steps, MotionCFG consistently improves motion dynamics across state-of-the-art T2V frameworks with negligible computational overhead and minimal compromise in visual quality. Additionally, we demonstrate that this noise-induced contrastive mechanism is effective not only for sharpening motion trajectories but also for steering complex, non-linear concepts such as precise object numerosity, which are typically difficult to modulate via standard text-based guidance.

Byungjun Kim, Soobin Um, Jong Chul Ye• 2026

Related benchmarks

TaskDatasetResultRank
Text-to-Video GenerationGemini-generated prompts
CLIPScore0.255
12
3D Motion GenerationUser Study--
10
Text-to-Video GenerationT2V-CompBench (test)
CLIPScore0.2593
8
Object CountingCountVid
MAE2.3952
5
CountingUser Study
Preference Rate (Visual Quality & Text Alignment) - Ours77.42
1
Showing 5 of 5 rows

Other info

Follow for update