| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| DAG Learning | Synthetic (test) | SID16 | 101 | |
| System Identification | Synthetic dataset | RE1 | 50 | |
| Regression | Synthetic weakly-periodic Interpolation (INT) | Normalized KL Divergence0.01 | 43 | |
| Causal Discovery | Synthetic (n=100, |E|=400, sample size=1000) | mAP99.6 | 36 | |
| Causal Discovery | Synthetic n=1000, |E|=2000, sample size=1000 | mAP96.6 | 32 | |
| Participatory Budgeting Rule Evaluation | Synthetic (test) | Omega'[sat^cost]1 | 30 | |
| Bigram Language Modeling | Synthetic WebText initialization (val) | Avg JS0.001 | 30 | |
| Bigram Language Modeling | Synthetic Random 50% initialization (val) | Avg JS Divergence0.0005 | 30 | |
| Causal Discovery | Synthetic Exponential Noise | ABIC Score30.13 | 30 | |
| Unknown sample identification | Synthetic | AUROC0.928 | 29 | |
| Fair Classification | Synthetic 1.0 (test) | Accuracy72.7 | 28 | |
| Static CT Reconstruction | Synthetic (test) | PSNR32.92 | 24 | |
| Classification | Synthetic (test) | Accuracy86.1 | 22 | |
| Node classification | Synthetic homo ratio 0.9 | Accuracy99.94 | 21 | |
| Node classification | Synthetic homo ratio 0.8 | Accuracy99.89 | 21 | |
| Node classification | Synthetic homo ratio 0.5 | Accuracy92.69 | 21 | |
| Multi-Image Super-Resolution | Synthetic 15 images (test) | PSNR (ME)56 | 18 | |
| Individual Treatment Effect (ITE) Estimation | Synthetic (out) | PEHE3.06 | 16 | |
| Individual Treatment Effect (ITE) Estimation | Synthetic | PEHE2.88 | 16 | |
| Node Classification | Synthetic sigma=1.0 | Mean Accuracy48.27 | 15 | |
| Node Classification | Synthetic sigma=0.6 | Mean Accuracy58.97 | 15 | |
| Participatory Budgeting | Synthetic Euclidean model (test) | Omega'[sat^card]1 | 15 | |
| Binary sequence classification | Synthetic Event-based (irregular) encoding | Accuracy99.93 | 13 | |
| Binary sequence classification | Synthetic Equidistant encoding | Accuracy100 | 13 | |
| Synthetic Regression | Synthetic mixture dx=2 | Normalized Log-Likelihood0.87 | 13 |