Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Original Dataset

Benchmarks

Task NameDataset NameSOTA ResultTrend
Shelf-life RegressionOriginal Dataset
MSE3.58
10
Spoilage DetectionOriginal Dataset
Spoilage F165
10
Vegetable ClassificationOriginal Dataset
F1 Score98
10
Jailbreak DefenseOriginal Dataset
ASR5.82
8
Multi-class Intent ClassificationOriginal Dataset
10-shot Accuracy86.2
4
Intent ClusteringOriginal Dataset
KM Score84.3
4
Trajectory State EstimationOriginal Dataset v1 (Short)
Center of Mass Error0.0085
3
Trajectory State EstimationOriginal Dataset Long v1
Center of Mass Error0.0284
3
Ovarian Cancer DetectionOriginal Dataset
Accuracy77.78
3
Showing 9 of 9 rows