Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Spambase

Benchmarks

Task NameDataset NameSOTA ResultTrend
LF Mislabeling IdentificationSpambase
AP92.7
32
End model evaluationSpambase
Test Loss0.258
22
Outlier DetectionSpamBase ADBench
AUROC66.2
17
Classificationspambase
Accuracy93.5
15
Outlier DetectionSpamBase (Group I)
AUROC66.2
14
ClassificationSpambase (test)
Test Loss0.283
13
Outlier DetectionSpamBase
AUC0.9021
11
Outlier DetectionSpamBase
AP86.31
11
Anomaly DetectionSpamBase
AUPRC89.24
10
Anomaly DetectionSpamBase Out-of-Domain
F1 Score81.8
10
Spam ClassificationSpambase 0% missingness (test)
AUC98.73
10
Hierarchical ClusteringSpambase
Dasgupta's Cost34,261,369.825
10
Hierarchical ClusteringSpambase
DP75.5
10
ClassificationSpambase
F1 Score93.92
9
CASHspambase (test)
Test Error0.0591
9
Private Decision Tree Evaluationspambase
Online Running Time31.2
8
ClassificationSpambase (5-fold cross-val)
Accuracy92.23
7
Abductive Explanation GenerationSpambase (test)
Average Execution Time (ms)2.92
6
Mixture Proportion EstimationUCI spambase (test)
Absolute Error0.006
6
PvN classificationUCI Spambase (test)
Accuracy89.4
6
Active Learning ClassificationSpambase
F1 Score78.5
5
Decision Tree Evaluationspambase
Overall Latency (s)19,700
4
Binary ClassificationSpambase (test)
Macro F1 Score91.7
4
Abductive Explanation GenerationSpambase Rejected
Avg Explanation Size48.89
3
Abductive Explanation GenerationSpambase
Avg Explanation Size1.24
3
Showing 25 of 31 rows