| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Out-of-Distribution Detection | CLINCSMALL | FPR0.1208 | 34 | |
| Out-of-Distribution Detection | CLINCFULL | FPR11.24 | 34 | |
| Out-of-Distribution Detection | CLINC OOD (train) | EER6.11 | 27 | |
| Intent Classification | Clinc150 (test) | Accuracy96.35 | 26 | |
| Out-of-Distribution Detection | CLINC Full (test) | AUROC97.24 | 21 | |
| Out-of-Distribution Detection | CLINC SMALL (test) | AUROC96.85 | 17 | |
| Intent Detection | CLINC 10-shot (test) | Accuracy94.84 | 16 | |
| Out-of-Distribution Detection | CLINC Movie IND OOD | EER3.7 | 16 | |
| Clustering | CLINC | Accuracy91 | 15 | |
| Intent Clustering | CLINC full 2019 | NMI93.89 | 13 | |
| Out-of-scope intent detection | CLINC-CreditCards OOS (test) | AUC (10% Threshold)86.4 | 12 | |
| Intent Detection | CLINC 5-shot (test) | Accuracy92.62 | 12 | |
| Intent Detection | CLINC Full (test) | Accuracy97.31 | 11 | |
| Intent Clustering | CLINC 1.0 (test) | K Predicted195 | 9 | |
| Intent Clustering | CLINC (I) | NMI94.86 | 6 | |
| Open intent recognition | CLINC | Accuracy93.42 | 6 | |
| Uncertainty Calibration | Clinc150 (test) | ECE0.021 | 4 | |
| Short-text Clustering | Clinc150 | NMI- | 0 |