Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Text-to-Speech on Spark-TTS generated audio

0.2057FAD (VGGish)

None

0.1999040.2390270.278150.317273May 25, 2026
Updated 7d ago

Evaluation Results

MethodLinks
2026.05
0.20570.04013.372.940.970.18
2026.05
0.22210.04843.32.970.980.12
2026.05
0.35060.04723.462.960.990.24