Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Text-to-Sound Generation on Clotho zero-shot (test)
Loading...
2.13
FAD
Make-an-Audio 2
1.9032
3.4341
4.965
6.4959
Oct 1, 2023
FAD
KL
OVL
REL
Updated 4d ago
Evaluation Results
Method
Method
Links
FAD
KL
OVL
REL
Make-an-Audio 2
Training Data (Hours)=...
2023.10
2.13
2.49
61.52
69.9
AudioGen
Training Data (Hours)=4k
2023.10
2.55
2.5
63.84
72.12
UniAudio
Training Data (Hours)=7k
2023.10
3.12
2.57
61.9
66.1
Tango
Training Data (Hours)=...
2023.10
3.61
2.59
66.2
68.57
AudioLDM
Training Data (Hours)=9k
2023.10
4.93
2.6
60.95
65.7
Diffsound
Training Data (Hours)=2k
2023.10
7.8
6.53
-
-
Reference
2023.10
-
-
70.47
78.84
Feedback
Search any
task
Search any
task