Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Binary safety classification on ML-BENCH (test)

97F1 (Seed Query)

ML-GUARD-7B

-1.823.8549.575.15May 1, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.05
979061999744619092
2026.05
858461947244270044
2026.05
857035835721135050
723433682124222177
2026.05
62231759139160018
2026.05
618644436002
2026.05
596440325001
2026.05
5798485412206
2026.05
4961033353103
2026.05
48474546474546454545
2026.05
463243621220101120
2026.05
4527530163411318
2021010000