Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MECAT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-text retrievalMECAT
Recall@10.2545
13
Text-to-Audio RetrievalMECAT (test)
Recall@18.02
13
Mixed-Audio GenerationMECAT Speech + Audio (S0A)
FADVGG30.38
10
Audio GenerationMECAT S00
FADVGG26.74
5
Audio GenerationMECAT 0M0
FADVGG21.68
5
Audio GenerationMECAT 00A
FADVGG51.42
5
Mixed-Audio GenerationMECAT Speech + Music (SM0)
FADVGG19.83
5
Mixed-Audio GenerationMECAT 0MA Music + Audio
FADVGG Score3.25
5
Audio CaptioningMECAT
FENSE Content Long Score60.11
3
Showing 9 of 9 rows