Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Jailbreaking on SafeBench evaluated on OpenAI-o1
Loading...
34.8
FS
o1
33.06
33.93
34.8
35.67
Nov 30, 2024
FS
QR
MML-WR
MML-R
Updated 4d ago
Evaluation Results
Method
Method
Links
FS
QR
MML-WR
MML-R
o1
System Prompt=none
2024.11
34.8
15.6
64.4
49.4
Feedback
Search any
task
Search any
task