Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

GPT

Benchmarks

Task NameDataset NameSOTA ResultTrend
JailbreakingGPT-4o
ASR0.99
9
JailbreakingGPT 5.1
ASR96.5
9
Adversarial AttackGPT-4o
CLIP Similarity (RN-50)0.259
9
Detection of paraphrased textGPT Paraphrased 4.1
ROC AUC (1% FPR)0.3977
8
Text-to-Video GenerationGPT-G
Semantic Objective76.8
4
Machine-generated text detectionGPT-3.5 (test)
Accuracy99.14
4
Text GenerationMiniGPT-4
BLEU-148.1
3
Showing 7 of 7 rows