Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Language Model Detoxification on Human Evaluation 50 generations (test)

0.49Detoxification Count

LM-Steer

0.21960.28980.360.4302May 22, 2023
Updated 4d ago

Evaluation Results

MethodLinks
2023.05
0.490.420.64
2023.05
0.480.50.64
2023.05
0.390.460.23
2023.05
0.380.420.36
2023.05
0.380.430.42
2023.05
0.230.20.25