| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Classification | SIM (test) | Macro F1-score96.2 | 36 | |
| Bivariate Causal Discovery | SIM-c | Accuracy88 | 33 | |
| Bivariate Causal Discovery | SIM | Accuracy88 | 33 | |
| Bivariate Causal Discovery | SIM | AUROC88.3 | 21 | |
| Cause-Effect Discovery | SIM-ln | Accuracy90 | 16 | |
| Cause-Effect Discovery | SIM-c | Accuracy85 | 16 | |
| Hyperspectral Unmixing | Sim1 | Processing Time37.1 | 9 | |
| Cause-Effect Discovery | SIM | Accuracy80 | 9 | |
| Causal Discovery | SIM | AUROC (SIM)88.3 | 8 | |
| Causal Discovery | SIM | Accuracy83 | 7 | |
| Simulation Problem | Sim 540k | Value Score8.54 | 7 | |
| Multiple Instance Learning | SIM independent held-out (test) | Accuracy86.7 | 4 | |
| Dish Wiping | Sim | Success Rate58.4 | 3 | |
| Fragile Egg Grasping | Sim | Success Rate74.8 | 3 | |
| In-Hand Box Flipping | Sim | Success Rate66 | 3 | |
| Image Reconstruction | SIM | PSNR (dB)27.7 | 2 | |
| Causal Discovery | SIM-c | AUDRC92 | 1 | |
| Causal Discovery | SIM | AUDRC90 | 1 | |
| ROS 2 Real-Time Scheduling | Sim | Metric- | 0 |