Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie Dubbing

About

Movie dubbing describes the process of transforming a script into speech that aligns temporally and emotionally with a given movie clip while exemplifying the speaker's voice demonstrated in a short reference audio clip. This task demands the model bridge character performances and complicated prosody structures to build a high-quality video-synchronized dubbing track. The limited scale of movie dubbing datasets, along with the background noise inherent in audio data, hinder the acoustic modeling performance of trained models. To address these issues, we propose an acoustic-prosody disentangled two-stage method to achieve high-quality dubbing generation with precise prosody alignment. First, we propose a prosody-enhanced acoustic pre-training to develop robust acoustic modeling capabilities. Then, we freeze the pre-trained acoustic system and design a disentangled framework to model prosodic text features and dubbing style while maintaining acoustic quality. Additionally, we incorporate an in-domain emotion analysis module to reduce the impact of visual domain shifts across different movies, thereby enhancing emotion-prosody alignment. Extensive experiments show that our method performs favorably against the state-of-the-art models on two primary benchmarks. The demos are available at https://zzdoog.github.io/ProDubber/.

Zhedong Zhang, Liang Li, Chenggang Yan, Chunshan Liu, Anton van den Hengel, Yuankai Qi• 2025

Related benchmarks

TaskDatasetResultRank
DubbingV2C-Animation + Chem + GRID (test)
MCD (DTW)6.6
8
Movie DubbingGRID2V2C
DD (Sync Error)0.5367
6
DubbingV2C-Animation
DD0.5148
6
DubbingChem
DD (Delay)0.4673
6
DubbingGRID
DD0.2551
6
Movie DubbingV2C2Chem
DD0.4649
6
Movie DubbingV2C2GRID
DD0.3146
6
Movie DubbingChem2V2C zero-shot
DD (Synchronization)0.565
6
Movie DubbingChem2GRID zero-shot
DD (Sync Error)0.3209
6
Movie DubbingGRID2Chem zero-shot
DD (Sync Error)0.5781
6
Showing 10 of 10 rows

Other info

Follow for update