Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

D1

Benchmarks

Task NameDataset NameSOTA ResultTrend
ClassificationD1
Mean Accuracy99.291
30
Time Series ForecastingSynthetic D1 (test)
MSE0.512
16
Medical Image SegmentationD1 Pediatric Organs in CT
DSC85.59
14
RegressionD1
Average Relative MSE0.038
10
ClassificationD1 0.15 (test)
Mean Accuracy99.742
10
Outlier DetectionD1 with only clusteriers (test)
AUC83.3
9
Aspect-level sentiment classificationD1
Accuracy79.11
9
SegmentationD1 trained on D0 (test)
Dice93.67
7
SegmentationD1 evaluated after training on D2
Dice Score93.67
7
Root Cause LocalizationD1 (complete data conditions)
Top-1 Score82.1
7
Missing Data ImputationD1 (test)
Micro AUPRC81.64
7
Hospital readmission predictionD1 (test)
Mean AUPRC21.51
7
Failure TriageD1 complete data conditions
Precision94.6
6
Anomaly DetectionD1 complete data conditions
Precision92.5
6
Time-Domain PredictionD1
NMSE (dB)-19.5
6
Reliability AssessmentD1 (test)
AU-ARC91.96
5
Frequency-Domain PredictionD1
NMSE (dB)-18.43
5
People countingD1 seen environment (70%-30%)
Average Precision (AP)0.838
4
CSI ReconstructionD1
NMSE (dB)-19.26
3
Root Cause LocalizationD1 (test)
Execution Time (s)21.52
2
Failure TriageD1 (test)
Execution Time (s)1.56
2
Anomaly DetectionD1 (test)
Execution Time (s)5.23
2
Document Question AnsweringD1
Correct Answers20
2
Visual Question AnsweringD1
Effective Answer Rate (C+P)50
2
Page LocalizationD1
Page Localization Success Rate1
1
Showing 25 of 27 rows