| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Personalized Federated Learning | Task 6 | Accuracy85.3 | 13 | |
| Personalized Federated Learning | Task 2 | Accuracy81.3 | 13 | |
| Personalized Federated Learning | Task 1 | Accuracy42.4 | 13 | |
| Disease Prediction | Task 4 | AUROC65 | 10 | |
| Disease Prediction | Task 7 | AUROC0.76 | 6 | |
| Disease Prediction | Task 6 | AUROC0.84 | 6 | |
| Deployment Performance | Task 6 | TTFT (s)1.227 | 4 | |
| Deployment Performance | Task 3 | TTFT (s)2.333 | 4 | |
| Deployment Performance | Task 2 | TTFT (s)1.376 | 4 | |
| Few-shot example selection | Task #12 | Score70 | 4 | |
| Few-shot example selection | Task #11 | Score0.87 | 4 | |
| Few-shot example selection | Task #10 | Score0.95 | 4 | |
| Few-shot example selection | Task 9 | Score0.65 | 4 | |
| Few-shot example selection | Task #8 | Score0.42 | 4 | |
| Few-shot example selection | Task #6 | Score84 | 4 | |
| Few-shot example selection | Task #5 | Score84 | 4 | |
| Few-shot example selection | Task #4 | Score97 | 4 | |
| Few-shot example selection | Task #3 | Score0.93 | 4 | |
| Few-shot example selection | Task #2 | Score27 | 4 | |
| Few-shot example selection | Task #1 | Score67 | 4 | |
| Disease Prediction | Task 8 | AUROC72 | 4 | |
| Incomplete Utterance Rewriting | TASK | EM0.692 | 4 | |
| Observe | Task Observe VIII | Metric- | 0 | |
| Lifting to Investigate | Task VI (Lifting to Investigate) | Metric- | 0 | |
| Grasping | Task IV Grasping | Metric- | 0 |