Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

S1

Benchmarks

Task NameDataset NameSOTA ResultTrend
Distillation data detectionS1
AUC95.3
63
Training data detectionS1
TPR@1%FPR47
39
Cluster count selectionS1
Selected Cluster Count5
21
ClusteringS1 N(150)
Accuracy100
20
Adversarial AttackS1 finance
ASR82
15
Machine Text DetectionS1
AUC91.8
15
Hyperspectral UnmixingS1 (simulated)
Magnetite RMSE0.0151
8
Membership Inference AttackS1 Distillation Gemini-2.0-flash
AUROC0.852
7
Membership Inference AttackS1.1 Distillation Deepseek-R1
AUROC98.4
7
Nuclear SegmentationS1 (full)
AJI+77.3
6
Point-level consensus correctness predictionS1
AUPRC99.7
4
Eigenfunction recoveryS1 sigma = 0.05 N=500 (test)
SubR20.787
3
Task performance and governanceS1 HighRisk
CDL0.5
2
ClusteringS1
SmM87.28
1
Showing 14 of 14 rows