Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MusicCaps

Benchmarks

Task NameDataset NameSOTA ResultTrend
Audio Watermarking AttributionMusicCaps
Accuracy (Att.) (%)100
352
Audio Watermark AttributionMusicCaps (test)
Attribution Accuracy100
85
Audio Watermark DetectionMusicCaps (test)
Detection Accuracy100
85
Audio Watermark DetectionMusicCaps balanced (val)
Accuracy100
85
Text-to-Music GenerationMusicCaps (evaluation set)
FAD2.18
20
Music GenerationMusicCaps (test)
FAD0
16
Music-to-Text RetrievalMusicCaps
R@124.6
12
Text-to-Music GenerationMusicCaps
KLD1.01
11
Music GenerationMusicCaps
FAD1.12
11
Text-to-Audio GenerationMusicCaps
FDopenl3108.69
10
Audio CaptioningMusicCaps
Captioning Score23.33
8
Music GenerationMusicCaps (full)
Aes8.26
8
Music CaptioningMusicCaps (test)
METEOR23.4
8
Text-to-Music GenerationMusicCaps unbalanced (test)
FAD2
7
Music ReconstructionMusicCaps
VISQOL Score4.06
6
Text-to-Music GenerationMusicCaps genre-balanced (test)
T2M-QLT85.7
6
Music GenerationMusicCaps 2023 (test)
FADVGG2.134
5
Audio Generation QualityMusicCaps MusicGen 32kHz (val)
FAD (VGGish)0.247
4
Music GenerationMusicCaps 25s-long clips
FD (OpenL3)85.2023
4
Music GenerationMusicCaps 10s-long clips
FD (OpenL3)74.4559
4
Audio CaptioningMusicCaps (MC) non-vocal
SBERT Similarity0.478
4
Text-to-Audio GenerationMusicCaps
Structure: Intro92.1
4
Text-to-Music RetrievalMusicCaps
R@16.69
4
Music RetrievalMusicCaps
Precision98
3
Text-to-Music GenerationMusicCaps
CLAP Similarity (Benign, User Question)0.33
3
Showing 25 of 28 rows