Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA Toxicity Steering benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
Toxicity Steering
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
Eso-LM (512 sequences)
DDPO
Toxicity Score
-9.2
12
1d ago
15 prefix prompts length 50
ILRR
Toxicity Accuracy
71.2
11
3mo ago
MDLM long sequence generation 512 length (test)
ILRR
Steering Accuracy
13.1
6
3mo ago
Showing 3 of 3 rows
25 / page
50 / page
100 / page
1
Search any
task
Search any
task