| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Shelf-life Regression | Original Dataset | MSE3.58 | 10 | |
| Spoilage Detection | Original Dataset | Spoilage F165 | 10 | |
| Vegetable Classification | Original Dataset | F1 Score98 | 10 | |
| Jailbreak Defense | Original Dataset | ASR5.82 | 8 | |
| Multi-class Intent Classification | Original Dataset | 10-shot Accuracy86.2 | 4 | |
| Intent Clustering | Original Dataset | KM Score84.3 | 4 | |
| Trajectory State Estimation | Original Dataset v1 (Short) | Center of Mass Error0.0085 | 3 | |
| Trajectory State Estimation | Original Dataset Long v1 | Center of Mass Error0.0284 | 3 | |
| Ovarian Cancer Detection | Original Dataset | Accuracy77.78 | 3 |