Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Titanic

Benchmarks

Task NameDataset NameSOTA ResultTrend
ClassificationTitanic
Accuracy78.5
28
RegressionTitanic
Standard Deviation0
20
Classificationtitanic (val)
AUROC0.883
12
ClassificationTitanic (test)
Accuracy80.5
9
ClassificationTitanic
AUC0.8736
8
Classificationtitanic 25% feature noise
Accuracy78.9
7
Classificationtitanic
F1-score85.2
7
Classificationtitanic 0% noise (test)
Accuracy79.1
7
Classificationtitanic (0% noise)
F1 Score85.6
7
Classificationtitanic 25% feature noise
F1-score86.2
7
Classificationtitanic 25% label noise
F1 Score85.9
7
RegressionTitanic
Average Relative Absolute Error38
6
Faithfulness under retrainingTitanic
AURC13.988
5
Binary ClassificationTitanic (test)
Macro F1-score77.6
5
domain-specific question answeringtitanic
Accuracy75.51
5
ClassificationTitanic (out-of-sample)
Median AUC0.8736
5
Instance attribution explanationTitanic (test)
Wall-clock Time (seconds)0.03
4
Target sensitivity estimationTitanic (test)
Pearson r1
3
Binary ClassificationTITANIC
R50013
3
p-robustness EstimationTITANIC
R50044
3
Lexical Coverage AnalysisTitanic
Coverage95
2
Data Contamination DetectionTitanic
Metric-
0
Fairness-aware classificationtitanic (test)
Metric-
0
Showing 23 of 23 rows