Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Evaluation set

Benchmarks

Task NameDataset NameSOTA ResultTrend
Image-to-Text Adversarial AttackEvaluation set
ASR97.4
48
Targeted Adversarial AttackEvaluation set (test)
Attack Success Rate (ASR)58.5
48
Camera Model IdentificationEvaluation set
Accuracy93.61
15
Lyrics-to-vocalsEvaluation set without audio prompt (test)
Musicality3.98
7
Language ModelingEvaluation Set
Loss1.844
4
Showing 5 of 5 rows