Share your thoughts, 1 month free Claude Pro on usSee more

Text-to-Speech on Spark-TTS generated audio

0.2057FAD (VGGish)

None

Updated 2mo ago

Evaluation Results

Method	Links
None 2026.05		0.2057	0.0401	3.37	2.94	0.97	0.18
Base 2026.05		0.2221	0.0484	3.3	2.97	0.98	0.12
Ours 2026.05		0.3506	0.0472	3.46	2.96	0.99	0.24