Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Dialogue Safety Evaluation on Safety Bench Unit Tests
Loading...
30
Safe Rate
Reddit 2.7B
1.088
8.594
16.1
23.606
May 2, 2022
Safe Rate
Realistic Rate
Unsafe Rate
Adversarial Rate
Updated 1mo ago
Evaluation Results
Method
Method
Links
Safe Rate
Realistic Rate
Unsafe Rate
Adversarial Rate
Reddit 2.7B
Parameter count=2.7B
2022.05
30
26.1
45
43.9
OPT-175B
Parameter count=175B
2022.05
3.3
26.1
56.7
28.3
BlenderBot 1
Training=Fine-tuned on...
2022.05
2.8
15
25
19.4
R2C2 BlenderBot
Training=Fine-tuned on...
2022.05
2.2
13.3
28.9
22.2
Feedback
Search any
task
Search any
task