Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Open-domain dialogue red teaming on Bloom ZS (filtered) (test)
Loading...
16.3
RSR
BRT (e+r)
0.7
4.75
8.8
12.85
May 27, 2023
RSR
Self-BLEU
Updated 4d ago
Evaluation Results
Method
Method
Links
RSR
Self-BLEU
BRT (e+r)
Target model=GODEL-lar...
2023.05
16.3
50.4
SL
Surrogate model=OPT-1....
2023.05
7.8
60.4
BRT (s+r)
Target model=GODEL-lar...
2023.05
5
50.4
Offensive Top-NQ
Target model=GODEL-lar...
2023.05
4.7
50.9
SFS
Surrogate model=OPT-1....
2023.05
3.3
51.4
SFS
Surrogate model=Bloom,...
2023.05
2.6
52.3
Rand
Target model=GODEL-lar...
2023.05
1.3
53.6
Feedback
Search any
task
Search any
task