Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

TTS

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multilingual Speech GenerationTTS multilingual (test)
WER0.447
60
Spoken Instruction FollowingTTS Unseen (test)
LC Win Rate77
7
Audio GenerationTTS 80-ms audio segments (test)
Latency (ms)4.4
3
Showing 3 of 3 rows