| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| PDE Solving | Dataset C Out-of-distribution 1.0 (test) | Relative L2 Error2.63 | 13 | |
| PDE Solving | Dataset C In-distribution 1.0 (test) | Relative L2 Error0.03 | 13 | |
| Clause-level privacy policy analysis | Dataset C | F1 Score73 | 4 | |
| Reconstruction | Dataset C (held-out) | Mean Reconstruction Score0.193 | 2 | |
| Autonomous rollout classification | Dataset C Lissajous | Mean Train Acc99.98 | 1 |