Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GPT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Jailbreak DefenseMiniGPT-4
Attack Success Rate (ASR)7.32
36
JailbreakingGPT-4o
ASR0.99
19
Language ModelingGPT Small (val)
Validation Perplexity27.95
12
AI-generated table detectionGPT 5.2 (External Holdout)
AUROC88.3
12
Adversarial AttackGPT-4o
ASR3.8
11
JailbreakingGPT 5.1
ASR96.5
9
Detection of paraphrased textGPT Paraphrased 4.1
ROC AUC (1% FPR)0.3977
8
JailbreakGPT 4.1 8 July 2025 release
ASR99.8
5
Text-to-Video GenerationGPT-G
Semantic Objective76.8
4
Machine-generated text detectionGPT-3.5 (test)
Accuracy99.14
4
Summary Similarity EvaluationGPT generated summaries 5.1
BERTScore-F186.5
3
Text GenerationMiniGPT-4
BLEU-148.1
3
AI-generated paper detectionGPT Clean Holdout 5.2 (test)
AUROC0.8857
1
Showing 13 of 13 rows