| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Safety Detection | chat 1m (test) | MCA Accuracy100 | 21 | |
| Chat | Chat | Chat Score49.3 | 8 | |
| Sleep staging | CHAT | AUC98.4 | 7 | |
| Safety Classification | Chat 1m-Conv | MCA99 | 6 | |
| Safety Classification | Chat 1m | MCA100 | 6 | |
| Text Summarization | Chat (test) | ROUGE-128.23 | 6 | |
| Computational cost analysis | chat 1m | Inference Latency (per prompt)0.01 | 5 | |
| Sleep Staging | chat in-distribution (test) | Macro F1 (Mean)86 | 4 | |
| Sleep Stage Classification | CHAT | Macro F186 | 2 |