Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Gemini

Benchmarks

Task NameDataset NameSOTA ResultTrend
Black-box Adversarial AttackGemini 2.5-Pro
KMRa0.87
9
JailbreakingGemini Pro 3
ASR92.5
9
Adversarial AttackGemini 2.0
CLIP Similarity (RN-50)0.2617
9
Policy Corruption EvaluationGemini-2-Flash
Compliance3.65
5
Targeted Adversarial AttackGemini 2.0
ASR520
4
Showing 5 of 5 rows