Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Titanic

Benchmarks

Task NameDataset NameSOTA ResultTrend
ClassificationTitanic
Accuracy86
31
RegressionTitanic
Standard Deviation0
20
ClassificationTitanic
AUC0.8736
16
Counterfactual Explanationstitanic
Validity44.7
14
Classificationtitanic (val)
AUROC0.883
12
ClassificationTitanic (test)
Accuracy80.5
9
ClassificationTitanic (three random train-test splits)
Accuracy81.6
8
Classificationtitanic 25% feature noise
Accuracy78.9
7
Classificationtitanic
F1-score85.2
7
Classificationtitanic 0% noise (test)
Accuracy79.1
7
Classificationtitanic (0% noise)
F1 Score85.6
7
Classificationtitanic 25% feature noise
F1-score86.2
7
Classificationtitanic 25% label noise
F1 Score85.9
7
RegressionTitanic
Average Relative Absolute Error38
6
Faithfulness under retrainingTitanic
AURC13.988
5
Binary ClassificationTitanic (test)
Macro F1-score77.6
5
domain-specific question answeringtitanic
Accuracy75.51
5
ClassificationTitanic (out-of-sample)
Median AUC0.8736
5
Instance attribution explanationTitanic (test)
Wall-clock Time (seconds)0.03
4
ClassificationTitanic (OOD)
Accuracy19
3
ClassificationTitanic (noisy)
Accuracy72.032
3
ClassificationTitanic (train)
Accuracy89.6
3
Target sensitivity estimationTitanic (test)
Pearson r1
3
Binary ClassificationTITANIC
R50013
3
p-robustness EstimationTITANIC
R50044
3
Showing 25 of 28 rows