| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Time-Series Anomaly Detection | TODS | Precision65 | 12 | |
| End-to-End Dialogue Modeling | ToDs (test) | Intent Accuracy95.45 | 11 | |
| Intent Classification | ToDs benchmark GPT-2 backbone (test) | Accuracy0.875 | 11 | |
| Anomaly Detection | TODS univariate | VUS-PR65.2 | 8 |