| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Classification | Red Wine | Recall74.8 | 19 | |
| Classification | Red Wine | Precision75 | 18 | |
| Classification | Red Wine | Accuracy75.3 | 13 | |
| Imbalanced Classification | Red Wine (test) | Macro F135.06 | 12 | |
| task generation | red-wine benchmark | Utility61.84 | 9 | |
| Faithfulness under retraining | Red wine | AURC0.355 | 5 | |
| Regression | Red Wine UCI (test) | ECP-950.95 | 4 | |
| Instance attribution explanation | Red wine (test) | Wall-clock Time (s)0.05 | 4 | |
| Target sensitivity estimation | Red wine (test) | Pearson r1 | 3 | |
| Multi-Class Formal Verification | Red-Wine | PAR2 Runtime3.83 | 2 |