Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CHATGPT

Benchmarks

Task NameDataset NameSOTA ResultTrend
Claim VerificationChatGPT (test)
Verification Confidence82.4
11
Jailbreak AttackCHATGPT-API
ASR98.33
3
Showing 2 of 2 rows