Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Hard

Benchmarks

Task NameDataset NameSOTA ResultTrend
Jailbreak DefenseHard (H)
FPR0
12
ClassificationHARD (test)
Accuracy97.77
8
Online LearningHARD
Latency (s)0.2516
8
Online Bin PackingHard28-R
Gap Percentage8.06
4
Showing 4 of 4 rows