Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GPT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Jailbreak DefenseMiniGPT-4
Attack Success Rate (ASR)7.32
36
JailbreakingGPT-4o
ASR0.99
19
Adversarial AttackGPT-4o
ASR0.6
14
JailbreakingGPT 5.1
ASR96.5
13
Targeted Adversarial AttackGPT 5.4
ASR0
12
Language ModelingGPT Small (val)
Validation Perplexity27.95
12
AI-generated table detectionGPT 5.2 (External Holdout)
AUROC88.3
12
End-to-end inference tuningGPT
Tuning Time (s)23.8
9
Transfer AttackGPT-5
Attack Success Rate18.67
9
Transfer AttackGPT 4.1
Attack Success Rate (ASR)4.33
9
Targeted AttackGPT closed-source standard MLLMs 5.4
ASR3.8
8
Targeted AttackGPT closed-source standard MLLMs 5.2
ASR3.3
8
Targeted AttackGPT-4o 5.2 (test)
Attack Success Rate (ASR)46.9
8
Language ModelingGPT Pre-training (val)
Validation Perplexity19.98
8
Detection of paraphrased textGPT Paraphrased 4.1
ROC AUC (1% FPR)0.3977
8
Language ModelingGPT nano (val)
Validation Loss3.25
5
JailbreakGPT 4.1 8 July 2025 release
ASR99.8
5
Text-to-Video GenerationGPT-G
Semantic Objective76.8
4
Machine-generated text detectionGPT-3.5 (test)
Accuracy99.14
4
Summary Similarity EvaluationGPT generated summaries 5.1
BERTScore-F186.5
3
Text GenerationMiniGPT-4
BLEU-148.1
3
AI-generated paper detectionGPT Clean Holdout 5.2 (test)
AUROC0.8857
1
Showing 22 of 22 rows