Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Song Describer

Benchmarks

Task NameDataset NameSOTA ResultTrend
Context Length EstimationSong Describer
Context Length (s)993
10
Audio ReconstructionSong Describer
L/R Mel0.9586
10
Audio CaptioningSong Describer (SD)
SBERT Similarity0.469
4
Music GenerationSong Describer Dataset no-singing 2m
Stereo Correctness96
3
Showing 4 of 4 rows