Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Red Teaming on DailyDialog against DialoGPT-large
Loading...
40
RSR
BRT (e+r)
0.376
10.663
20.95
31.237
May 27, 2023
RSR
Self-BLEU (k)
Updated 4d ago
Evaluation Results
Method
Method
Links
RSR
Self-BLEU (k)
BRT (e+r)
Query Limit (NQ)=20,000
2023.05
40
24.9
BRT (e)
Query Limit (NQ)=20,000
2023.05
37.1
34.5
SL
Query Limit (NQ)=20,00...
2023.05
13.1
49.4
SFS
Query Limit (NQ)=20,00...
2023.05
11.7
43.6
BRT (s+r)
Query Limit (NQ)=20,000
2023.05
5.4
37.7
Offensive Top-NQ
Query Limit (NQ)=20,000
2023.05
5.3
38.1
BRT (s)
Query Limit (NQ)=20,000
2023.05
4.9
38.5
Rand
Query Limit (NQ)=20,000
2023.05
1.9
38.8
Feedback
Search any
task
Search any
task