Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PKU-Safety

Benchmarks

Task NameDataset NameSOTA ResultTrend
Human Preference AlignmentPKU-Safety
Win Rate67.1
3
LLM AlignmentPKU-Safety (test)
Win Rate58
2
Showing 2 of 2 rows