Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

EMTD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio-driven half-body human video generationEMTD 1.0 (evaluation set)
FID49.33
14
Talking avatar video generationEMTD (test)
FID59.87
10
Head-Oriented Image-to-Video GenerationEMTD
IQA2.31
6
Audio-driven video generationEMTD (test)
FID15.66
6
Talking Head GenerationEMTD
Sync-C8.61
4
Audio-to-video synthesisEMTD
LSE-C7.05
3
Showing 6 of 6 rows