Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Speaker

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-prompted separationSpeaker
SAJ1.85
9
Visual-prompted audio separationSpeaker
IB Score0.24
5
Avatar GenerationSpeaker-5M (test)
PSNR22.09
5
Showing 3 of 3 rows