Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Alpaca + XSTest

Benchmarks

Task NameDataset NameSOTA ResultTrend
Safety DetectionAlpaca + XSTest full (val-selected)
AUROC0.991
42
Refusal DetectionAlpaca + XSTest (val)
AUROC0.988
42
Showing 2 of 2 rows