Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Red Teaming against BB-3B on Empathetic Dialogues
Loading...
41.3
RSR
BRT (e)
1.26
11.655
22.05
32.445
May 27, 2023
RSR
Self-BLEU (k)
Updated 4d ago
Evaluation Results
Method
Method
Links
RSR
Self-BLEU (k)
BRT (e)
Query limit (NQ)=20,000
2023.05
41.3
35.6
BRT (e+r)
Query limit (NQ)=20,000
2023.05
40.2
35.2
SFS
Backbone=OPT-1.3B, Que...
2023.05
13.9
40.1
SL
Backbone=OPT-1.3B, Que...
2023.05
13.7
48.3
SFS
Backbone=Bloom, Query...
2023.05
11.3
42.3
BRT (s+r)
Query limit (NQ)=20,000
2023.05
7.2
37.1
BRT (s)
Query limit (NQ)=20,000
2023.05
7
37.7
Offensive Top-NQ
Query limit (NQ)=20,000
2023.05
6.5
37.6
Rand
Query limit (NQ)=20,000
2023.05
2.8
38.4
Feedback
Search any
task
Search any
task