| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Arithmetic Reasoning | In-domain (test) | Accuracy53.4 | 50 | |
| Interactive Segmentation | In-domain (test) | IoU86.37 | 14 | |
| Instruction Following | In-domain | Win Rate14 | 11 | |
| Machine Translation | In-Domain (ID) (val) | BLEU40.72 | 10 | |
| Ambisonics encoding | In-Domain | Coherence54 | 7 | |
| AI-Generated Text Detection | In-domain (test) | OA100 | 4 | |
| Shading Estimation | In-domain | MSE0.0265 | 3 | |
| Albedo Estimation | In-domain | MSE0.0051 | 3 | |
| Depth Estimation | In-domain | REL0.1072 | 3 |