Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Song Describer

Benchmarks

Task NameDataset NameSOTA ResultTrend
Context Length EstimationSong Describer
Context Length (s)993
10
Audio ReconstructionSong Describer
L/R Mel0.9586
10
Audio CaptioningSong Describer (SD)
SBERT Similarity0.469
4
Music GenerationSong Describer Dataset no-singing 2m
Stereo Correctness96
3
Showing 4 of 4 rows