Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Audio Generation benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Audio Generation
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
LJ Speech (test)
Autoregressive flow
LL Score
5.161
20
3mo ago
Speech clean (test)
None
Generation Success Rate
100
18
3mo ago
Speech Noise SNR=-10 (test)
SEEN
Success Rate
80
18
3mo ago
Sound Clean (test)
None
Generation Success Rate
100
18
3mo ago
Sound Noise SNR=-10 (test)
SEEN
Success Rate
78.33
18
3mo ago
Music clean (test)
None
Generation Success Rate
100
18
3mo ago
Music Noise SNR=-10 (test)
SEEN
Generation Success Rate
85
18
3mo ago
LibriTTS (dev)
HiFi-GAN
M-STFT
1.3647
18
3mo ago
AudioSet AAR 20k
TurboQuant-MSE
Minimum LSD
0
15
13d ago
MMAU Speech (Noise)
SEEN
GSR
80
15
3mo ago
MMAU Speech (Clean)
SEEN
GSR
99
15
3mo ago
MMAU Sound (Noise)
SEEN
GSR
78.33
15
3mo ago
MMAU Sound (Clean)
SEEN
GSR
99.67
15
3mo ago
MMAU Music (Noise)
SEEN
GSR
85
15
3mo ago
MMAU Music (Clean)
SEEN
GSR
99
15
3mo ago
Long-Term Audio
MED only
FAD
2.204
9
2mo ago
LJSpeech Short-Term (test)
BemaGANv2
FAD
0.911
9
2mo ago
AudioSet
AG-REPA
FAD
2.56
8
3mo ago
EpicBench T2A 1.0 (test)
T2A-Feedback + DPO
Win Rate EOS
68
8
3mo ago
MAESTRO (test)
Self-attention
FAD (unconditional)
0.131
6
2mo ago
MECAT S00
MusicGen
FADVGG
26.74
5
6d ago
MECAT 0M0
Qwen3-TTS
FADVGG
21.68
5
6d ago
MECAT 00A
Qwen3-TTS
FADVGG
51.42
5
6d ago
Real Recordings
AMAVA
Fréchet Audio Distance (FAD)
1.71
5
1mo ago
HDTF
Narrating For You
FAD
106.43
5
3mo ago
Showing 25 of 37 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs