Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Language Model Detoxification on Human Evaluation 50 generations (test)
Loading...
0.49
Detoxification Count
LM-Steer
0.2196
0.2898
0.36
0.4302
May 22, 2023
Detoxification Count
Fluency Count
Topical Relevance Count
Updated 4d ago
Evaluation Results
Method
Method
Links
Detoxification Count
Fluency Count
Topical Relevance Count
LM-Steer
baseline_comparison=GPT-2
2023.05
0.49
0.42
0.64
LM-Steer
baseline_comparison=DE...
2023.05
0.48
0.5
0.64
DExperts
baseline_comparison=LM...
2023.05
0.39
0.46
0.23
LM-Steer
baseline_comparison=LoRA
2023.05
0.38
0.42
0.36
GPT-2
baseline_comparison=LM...
2023.05
0.38
0.43
0.42
LoRA
baseline_comparison=LM...
2023.05
0.23
0.2
0.25
Feedback
Search any
task
Search any
task