Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Claude

Benchmarks

Task NameDataset NameSOTA ResultTrend
Jailbreak AttackClaude 3.5
ASR2
10
Black-box Adversarial AttackClaude thinking 4.0
KMR (a)0.02
9
JailbreakingClaude 4.5
ASR97
9
AI-generated text detectionClaude-generated (test)
F1 Score92.2
5
Showing 4 of 4 rows