Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mixed-Audio Generation on MECAT Speech + Music (SM0)
Loading...
19.83
FADVGG
MusicGen
0.9748
5.8699
10.765
15.6601
May 27, 2026
FADVGG
FDPANNS
KL Divergence
CLAP
GLAP
WER (%)
UTMOSv2
Updated 6d ago
Evaluation Results
Method
Method
Links
FADVGG
FDPANNS
KL Divergence
CLAP
GLAP
WER (%)
UTMOSv2
MusicGen
2026.05
19.83
57.28
2.38
13.8
-4.99
99.99
1.65
TangoFlux
2026.05
11.57
40.77
1.11
33.7
1.85
99.44
1.4
Qwen3-TTS
2026.05
10.2
63.15
2.16
18.6
-3.78
15.6
3.49
Expert-Pipeline
2026.05
9.55
24.1
0.69
30.9
5.36
24.31
2.26
Dasheng AudioGen
2026.05
1.7
6.69
0.33
32.7
9.8
21.96
2.72
Feedback
Search any
task
Search any
task