| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Anti-spoofing | Pooled | EER4.69 | 10 | |
| Mathematical Reasoning | Pooled 5-benchmark set | Accuracy54.44 | 6 | |
| Code Editing Copy-as-Decode Efficiency Analysis | Pooled (All) | Number of Cases482 | 1 | |
| Competing Risks Survival Analysis | Pooled 4 datasets (10 splits x 2 events) | Metric- | 0 |