Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GPT-4o

Benchmarks

Task NameDataset NameSOTA ResultTrend
Jailbreak AttackGPT-4o API
ASR84
16
Targeted Adversarial AttackGPT-4o
ASR860
12
Open-ended image descriptionGPT-4o assisted evaluation
Accuracy8.76
8
Detection of paraphrased textGPT-4o-mini Paraphrased
ROC AUC (FPR=1%)0.4231
8
Jailbreak AttackGPT-4o (test)
ASR95
6
Denial-of-Service AttackGPT-4o-mini 2024-07-18 (test)
Response Length16,384
6
JailbreakGPT-4o 29 May 2025 release
ASR98.46
5
Policy Corruption EvaluationGPT-4o mini
Compliance Score3.53
5
Keyword Matching AttackGPT-4o
KMR (alpha)73
4
Sycophancy-Induced Spiral Dynamics InterventionGPT-4o high-sycophancy deployment (n = 200, T = 30)
Spiral Rate16.5
3
JailbreakingGPT-4o efficiency analysis
Attack Success Rate (ASR)65.7
3
Showing 11 of 11 rows