Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Dialogue Safety Evaluation on Safety Bench Unit Tests
Loading...
30
Safe Rate
Reddit 2.7B
1.088
8.594
16.1
23.606
May 2, 2022
Safe Rate
Realistic Rate
Unsafe Rate
Adversarial Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Safe Rate
Realistic Rate
Unsafe Rate
Adversarial Rate
Reddit 2.7B
Parameter count=2.7B
2022.05
30
26.1
45
43.9
OPT-175B
Parameter count=175B
2022.05
3.3
26.1
56.7
28.3
BlenderBot 1
Training=Fine-tuned on...
2022.05
2.8
15
25
19.4
R2C2 BlenderBot
Training=Fine-tuned on...
2022.05
2.2
13.3
28.9
22.2
Feedback
Search any
task
Search any
task