Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

AudioCaps, Clotho, and MECAT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-Text Retrieval (T2T)AudioCaps, Clotho, and MECAT Mean
R@146.46
13
Text-to-Audio Retrieval (T2A)AudioCaps, Clotho, and MECAT Mean
Recall@10.221
13
Showing 2 of 2 rows