| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Affine-invariant depth estimation | DA-2K | Accuracy97.1 | 16 | |
| Certainty Equivalent Estimation | DA non-mixed prospects 1992 | Median Certainty Equivalent (p=0.1)59 | 13 | |
| Entity Matching | DA (test) | F1 Score98.4 | 13 | |
| Dialogue Analysis | DA | R Metric92.5 | 10 | |
| 3D Spatial understanding | DA-2K | Pass@191.9 | 7 |