Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Harmlessness on Template T3 GPT-4 evaluation (test)

87.5Win Rate

SafeDPO

25.224841.392457.5673.7276May 26, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.05
87.510.382.12
2025.05
68.7519.3811.88
2025.05
58.3833.258.38
2025.05
43.8845.510.62
2025.05
27.6249.6222.75