Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Downstream Audio Generation

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-To-AudioDownstream Audio Generation TTA
FAD1.987
8
Text-To-MusicDownstream Audio Generation (TTM)
FAD3.366
8
Text-To-SpeechDownstream Audio Generation TTS
WER3.03
8
Showing 3 of 3 rows