Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Safety Alignment Evaluation on Safety Evaluation Dataset

81Response Safe Rate (Llama Guard Model)

DCR

64.3668.687377.32Feb 10, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.02
8192
2026.02
7887
2026.02
7788
2026.02
7286
2026.02
6580