Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GPT-4o

Benchmarks

Task NameDataset NameSOTA ResultTrend
Detection of paraphrased textGPT-4o-mini Paraphrased
ROC AUC (FPR=1%)0.4231
8
Denial-of-Service AttackGPT-4o-mini 2024-07-18 (test)
Response Length16,384
6
Policy Corruption EvaluationGPT-4o mini
Compliance Score3.53
5
Targeted Adversarial AttackGPT-4o
ASR860
4
Showing 4 of 4 rows