| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Jailbreak Detection | JBShield evaluation suite GCG attack on Llama-3-8B | Detection Accuracy100 | 4 | |
| Jailbreak Detection | JBShield evaluation suite Base64 attack on Llama-3-8B | Detection Accuracy100 | 2 | |
| Jailbreak Detection | JBShield evaluation suite Zulu attack on Llama-3-8B | Accuracy100 | 2 | |
| Jailbreak Detection | JBShield Puzzler attack on Llama-3-8B | Detection Accuracy100 | 2 | |
| Jailbreak Detection | JBShield evaluation suite DrAttack attack on Llama-3-8B | Detection Accuracy100 | 2 | |
| Jailbreak Detection | JBShield PAIR attack on Llama-3-8B | Detection Accuracy77 | 2 | |
| Jailbreak Detection | JBShield AutoDAN attack on Llama-3-8B | Detection Accuracy97 | 2 | |
| Jailbreak Detection | JBShield evaluation suite IJP attack on Llama-3-8B | Detection Accuracy96 | 2 | |
| Jailbreak Attack Success Rate | JBShield (test) | Attack Success Rate (ASR)95 | 1 |