Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Description-based speech generation on Accent+ (test)
Loading...
0.596
JointCLAP
AUDIOBOX
0.09056
0.22178
0.353
0.48422
Dec 25, 2023
JointCLAP
WER (%)
QMOS
REL
Updated 4d ago
Evaluation Results
Method
Method
Links
JointCLAP
WER (%)
QMOS
REL
AUDIOBOX
2023.12
0.596
2.6
-
-
ground truth
2023.12
0.561
13.5
-
-
VoiceLDM
2023.12
0.235
4.4
-
-
AudioLDM2-SP
2023.12
0.11
23.9
-
-
Feedback
Search any
task
Search any
task