Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

SCM

Benchmarks

Task NameDataset NameSOTA ResultTrend
Causal Perturbation PredictionSCM dataset linear SCMs (test)
W29.82
10
RegressionSCM20d
Epsilon0.0074
9
PredictionSCM marginal shift
ROC AUC1
9
Binary ClassificationSCM spurious shift (test)
ROC AUC0.733
9
Sample generationscm20d
Standardized Energy Distance10
7
Sample generationscm1d
Standardized energy distance10
7
Multi-Target RegressionSCM20d
Running time (s)79.9444
5
Multi-Target RegressionSCM20d
Model size (MB)275.8278
5
Multi-Target RegressionSCM1d
Model Size (MB)660.5839
5
Multivariate Regression Uncertainty Quantificationscm20d
Coverage (%)99.3
4
Multivariate Regressionscm20d
Coverage90.4
4
Multivariate Uncertainty Quantificationscm20d
Normalized Volume4.25
4
Multivariate Uncertainty Quantificationscm1d
Normalized Volume4.43
4
Multivariate Regressionscm20d
Normalized Volume3.45
4
Long-running Conversational MemorySCM
Answer Accuracy88.4
1
Showing 15 of 15 rows