Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

ML-BENCH

Benchmarks

Task NameDataset NameSOTA ResultTrend
Binary safety classificationML-BENCH (test)
F1 (Seed Query)97
13
Showing 1 of 1 rows