| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multi-path Speculative Decoding | Held-out (test) | Average Block Efficiency6.84 | 24 | |
| Bargaining | Held-Out (test) | Reward0.7664 | 16 | |
| Tone Mapping | Held-out (test) | PSNR40.59 | 6 | |
| Clinical case generation | Held-out (test) | BLEU-418.98 | 6 | |
| License Plate Recognition | held-out (test) | Plate Accuracy92.3 | 5 | |
| binary classification | held-out n=2,332 (test) | Accuracy99.61 | 4 | |
| Supply chain disruption forecasting | Held-out (test) | Brier Score0.0791 | 4 |