Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Text-to-Audio Generation on RiTTA Count OOD (test)
Loading...
2.18
KL Divergence
TangoFlux-RAG
2.1548
2.3249
2.495
2.6651
Nov 2, 2025
KL Divergence
FD (Fréchet Distance)
FAD (Fréchet Audio Distance)
IS (Inception Score)
CLAP Score
Updated 4d ago
Evaluation Results
Method
Method
Links
KL Divergence
FD (Fréchet Distance)
FAD (Fréchet Audio Distance)
IS (Inception Score)
CLAP Score
TangoFlux-RAG
base_model=TangoFlux
2025.11
2.18
37.7
5.1
7.3
43.7
TangoFlux
2025.11
2.22
46.8
7.3
7
43.3
AudioLDM2-RAG
base_model=AudioLDM2
2025.11
2.71
35.2
4.4
8.5
34.2
AudioLDM2
2025.11
2.81
38.5
7.7
7.4
29
Feedback
Search any
task
Search any
task