Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

DALL-E

Benchmarks

Task NameDataset NameSOTA ResultTrend
Membership Inference AttackDALL·E (test)
TPR @ 5% FPR29.4
54
Image Copy DetectionDALL-E 2 (test)
Average Similarity0.826
28
Membership Inference AttackDALL-E
Accuracy72.3
26
Generated Image DetectionDALL-E 3
AP98.1
15
Synthetic Image DetectionDALL-E 2 (full)
Acc@EER77.3
12
Fake image detectionDALL·E
AA98.81
9
Synthetic Image DetectionDALL-E 2 (Academic)
AUCROC0.727
9
Synthetic Image DetectionDALL-E 3 (proprietary)
AUCROC0.759
9
Synthetic Image DetectionDALL-E 2 (proprietary)
AUCROC85.4
9
Synthetic image detectionDALL-E
Accuracy98.8
9
Nudity Jailbreaking TransferDALL·E Universal nudity jailbreaking prompts 3
Transfer Success Rate35.58
7
Image WatermarkingDALL-E 3
PSNR18.28
7
Watermark RobustnessDALL-E 3
Robustness: Brightness100
7
Adversarial AttackDALL·E 3 commercial (test)
BR0.95
7
Jailbreak AttackDALL·E 3
TASR (%)12.98
6
Synthetic Image DetectionDALL-E 3 (full)
Acc@EER69.8
6
Synthetic Image DetectionDALL-E Anime 2
AP62
6
Synthetic Image DetectionDALL-E 3
AP0.756
6
Synthetic Image DetectionDALL-E 2
AP86.7
6
Deepfake DetectionDALL-E mini
AP99.33
4
Deepfake DetectionDALL-E3
Accuracy82
3
Text-to-Image JailbreakingDALL·E 3 with ChatGPT 5
ASR48
3
Image AdherenceDALL-E 3 Prompt Set
Adherence Score (%)84.4
2
Image QualityDALL-E 3 Prompt Set
Score90
2
Showing 24 of 24 rows