Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GPT-4o

Benchmarks

Task NameDataset NameSOTA ResultTrend
Jailbreak AttackGPT-4o API
ASR84
16
Detection of paraphrased textGPT-4o-mini Paraphrased
ROC AUC (FPR=1%)0.4231
8
Denial-of-Service AttackGPT-4o-mini 2024-07-18 (test)
Response Length16,384
6
JailbreakGPT-4o 29 May 2025 release
ASR98.46
5
Policy Corruption EvaluationGPT-4o mini
Compliance Score3.53
5
Targeted Adversarial AttackGPT-4o
ASR860
4
JailbreakingGPT-4o efficiency analysis
Attack Success Rate (ASR)65.7
3
Showing 7 of 7 rows