Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

EMTD

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio-driven half-body human video generationEMTD 1.0 (evaluation set)
FID49.33
14
Talking Head GenerationEMTD
Sync-C5.596
10
Talking avatar video generationEMTD (test)
FID59.87
10
Head-Oriented Image-to-Video GenerationEMTD
IQA2.31
6
Audio-driven video generationEMTD (test)
FID15.66
6
Talking Head GenerationEMTD long-horizon streaming
Sync-C9.593
5
Talking Head GenerationEMTD
Sync-C8.61
4
Audio-to-video synthesisEMTD
LSE-C7.05
3
Showing 8 of 8 rows