| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Image-to-Text Adversarial Attack | Evaluation set | ASR97.4 | 48 | |
| Targeted Adversarial Attack | Evaluation set (test) | Attack Success Rate (ASR)58.5 | 48 | |
| Camera Model Identification | Evaluation set | Accuracy93.61 | 15 | |
| Lyrics-to-vocals | Evaluation set without audio prompt (test) | Musicality3.98 | 7 | |
| Language Modeling | Evaluation Set | Loss1.844 | 4 |