| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Word Embedding Accuracy | English dataset | Accuracy85.8 | 18 | |
| Misinformation Detection | English Dataset | Macro F176.08 | 18 | |
| Text Classification | English Dataset | Accuracy0.9148 | 11 | |
| Jailbreak Safety Evaluation | English dataset Multi-Image | StrongREJECT (Perturbed)14 | 6 | |
| Jailbreak Safety Evaluation | English dataset Single-Image | StrongREJECT (Perturbed)10 | 6 | |
| Jailbreak Safety Evaluation | English dataset Text | StrongREJECT Rate0.01 | 6 | |
| Speech Reconstruction | English dataset 2 kHz sampling | LSD1.01 | 5 | |
| Speech Reconstruction | English dataset 1 kHz sampling | LSD1.19 | 5 | |
| Speech Reconstruction | English dataset 500 Hz sampling | LSD1.29 | 5 |