Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Twins

Benchmarks

Task NameDataset NameSOTA ResultTrend
Treatment Effect EstimationTWINS
Mean Effect0
15
Counterfactual Error EstimationTwins (in-sample)
AUC0.87
13
Counterfactual error estimationTwins (out-sample)
AUC86.1
13
Causal EstimationTwins Posterior
Mean BSE0.0001
10
Heterogeneous Treatment Effect EstimationTwins censoring Real (test)
RMSE (L=30)2.63
9
Heterogeneous Treatment Effect EstimationTwins no censoring Real (test)
RMSE (L=30)2.53
9
Continuous Treatment Effect EstimationTwins
RMSE0.031
6
Average Treatment Effect EstimationTwins (n=3200)
MAE (eATE)0.061
6
Average Treatment Effect EstimationTwins (n=1600)
Mean Absolute ATE Error (eATE)0.061
6
Average Treatment Effect EstimationTwins n=200
MAE (eATE)0.065
6
Average Treatment Effect EstimationTwins (n=50)
Mean Absolute ATE Error (eATE)0.062
6
Causal EstimationTwins Prior
Mean BSE0.039
5
Individual Treatment Effect estimationTwins (out-sample)
AUC0.861
5
Individual Treatment Effect EstimationTwins (in-sample)
AUC0.87
5
Computational Efficiency AnalysisTwins
Runtime (s/it)68.748
2
Showing 15 of 15 rows