Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Twins

Benchmarks

Task NameDataset NameSOTA ResultTrend
Conditional Average Treatment Effect EstimationTwins RCT
Multiplicative Calibration Error1.372
15
Uplift ModelingTwins
Qini Coefficient0.092
15
Treatment Effect EstimationTWINS
Mean Effect0
15
Counterfactual Error EstimationTwins (in-sample)
AUC0.87
13
Counterfactual error estimationTwins (out-sample)
AUC86.1
13
Causal EstimationTwins Posterior
Mean BSE0.0001
10
Heterogeneous Treatment Effect EstimationTwins censoring Real (test)
RMSE (L=30)2.63
9
Heterogeneous Treatment Effect EstimationTwins no censoring Real (test)
RMSE (L=30)2.53
9
Counterfactual Distribution EstimationTWINS HeavyTails outcome (synthetic)
Average W1 Error0.25
6
Continuous Treatment Effect EstimationTwins
RMSE0.031
6
Average Treatment Effect EstimationTwins (n=3200)
MAE (eATE)0.061
6
Average Treatment Effect EstimationTwins (n=1600)
Mean Absolute ATE Error (eATE)0.061
6
Average Treatment Effect EstimationTwins n=200
MAE (eATE)0.065
6
Average Treatment Effect EstimationTwins (n=50)
Mean Absolute ATE Error (eATE)0.062
6
Causal EstimationTwins Prior
Mean BSE0.039
5
Individual Treatment Effect estimationTwins (out-sample)
AUC0.861
5
Individual Treatment Effect EstimationTwins (in-sample)
AUC0.87
5
Counterfactual PredictionTwins RealCause (test)
Empirical Coverage99.8
3
Computational Efficiency AnalysisTwins
Runtime (s/it)68.748
2
Treatment Effect EstimationTwins
Coverage Mean97.12
1
Showing 20 of 20 rows