Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Audio Captioning on AudioCaps (MLLM/Human Evaluation)

0.75MWR-S (MLLM)

SoundAtlas

0.34440.44970.5550.6603Jan 6, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.01
0.750.580.710.69
2026.01
0.390.410.310.26
0.360.510.460.55