Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Credit

Benchmarks

Task NameDataset NameSOTA ResultTrend
ClassificationCredit
ROCAUC98.6
63
Numerical ReasoningCredit
Hit Rate @ 174.5
24
Contamination detectioncredit
Acomp0.81
24
Faithful Narrative Generationcredit
RA100
16
Counterfactual ExplanationsCredit (test)
IM11.0767
16
Fairness ClassificationCredit (test)
Disparate Impact (DP)0.1762
14
Fairness EvaluationCredit (test)
PP5.66
14
Classificationcredit (test)
EOpp0
14
Node ClassificationCredit Dataset
BACC69.95
14
Data ImputationCredit-g
Accuracy54.46
13
Graph Multiple Sensitive Attribute Inference AttackCredit
AA (Attribute Accuracy)74.71
10
Counterfactual GenerationCredit
Runtime (minutes)0
9
Classificationcredit (UCI)
Accuracy83.1
9
Node ClassificationCredit r: 0.01 (test)
ACC (%)78.13
9
Classificationcredit small
Accuracy99.39
9
ClassificationCredit
Error Rate24.79
9
Actionable Counterfactual GenerationCredit 1994 (test)
Validity100
9
Tabular ClassificationCredit
Clean Accuracy83.3
9
Multi-class classificationCredit
Macro F1 Score0.48
9
ClassificationCredit
Individual Fairness Gap12.8
8
Strategic ClassificationCredit
Post-manipulation Accuracy83.96
8
Node ClassificationCredit (test)
Accuracy73.79
8
Full black box attackCredit (test)
FBB ROC AUC0.71
8
Explanation GenerationCredit
PPL4.3
7
ClassificationCredit age (test)
F1 Score83.47
7
Showing 25 of 53 rows