Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Toxicity mitigation evaluation on Specialized category
Loading...
59.6
RTR
Optimus
0.736
16.018
31.3
46.582
Jul 8, 2025
RTR
PPL
FBD
GRD
Updated 16d ago
Evaluation Results
Method
Method
Links
RTR
PPL
FBD
GRD
Optimus
Backbone=LLaMA-2, Defe...
2025.07
59.6
5.83
0.097
60.1
Optimus
Backbone=LLaMA-2, Defe...
2025.07
13.1
5.46
0.097
59.6
Optimus
Backbone=LLaMA-2, Defe...
2025.07
9.7
6
0.099
61
Optimus
Backbone=LLaMA-2, Defe...
2025.07
8.2
6.19
0.1
60.8
Optimus
Backbone=LLaMA-2, Defe...
2025.07
7.7
6
0.1
61.5
Optimus
Backbone=LLaMA-2, Defe...
2025.07
6.8
6.2
0.101
60.9
Optimus
Backbone=LLaMA-2, Defe...
2025.07
3.2
6.11
0.102
60.4
Optimus
Backbone=LLaMA-2, Defe...
2025.07
3
6.27
0.103
60.6
Feedback
Search any
task
Search any
task