Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

MusicCaps

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-Music GenerationMusicCaps (evaluation set)
FAD2.18
20
Text-to-Music GenerationMusicCaps
KLD1.01
11
Music GenerationMusicCaps
FAD1.12
11
Music GenerationMusicCaps (test)
FAD1.12
10
Text-to-Audio GenerationMusicCaps
FDopenl3108.69
10
Music GenerationMusicCaps (full)
Aes8.26
8
Text-to-Music GenerationMusicCaps unbalanced (test)
FAD2
7
Music ReconstructionMusicCaps
VISQOL Score4.06
6
Text-to-Music GenerationMusicCaps genre-balanced (test)
T2M-QLT85.7
6
Music CaptioningMusicCaps (test)
Relevance5.77
5
Music GenerationMusicCaps 2023 (test)
FADVGG2.134
5
Music GenerationMusicCaps 25s-long clips
FD (OpenL3)85.2023
4
Music GenerationMusicCaps 10s-long clips
FD (OpenL3)74.4559
4
Audio CaptioningMusicCaps (MC) non-vocal
SBERT Similarity0.478
4
Text-to-Audio GenerationMusicCaps
Structure: Intro92.1
4
Text-to-Music RetrievalMusicCaps
R@16.69
4
Music-to-Text RetrievalMusicCaps
R@16.37
4
Audio-Visual RetrievalMusicCaps (test)
Recall@120.4
2
Music UnderstandingMusicCaps
CLAP Score0.16
2
Text-to-Music GenerationMusicCaps (test)
REL (General Audience)4.09
1
Showing 20 of 20 rows