| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Image Retrieval | Overall Average | mAP60.6 | 21 | |
| Creative Writing | Overall Average Poem, Joke, Story | Semantic Diversity0.3603 | 20 | |
| Depth Completion | Overall Average (ScanNet, IBims-1, VOID, NYUv2, KITTI, DDAD) | Rank1.75 | 17 | |
| Open-Vocabulary Semantic Segmentation | Overall Average 9 datasets | Average IoU46.9 | 10 | |
| Aggregated Reasoning Evaluation | Overall Average | Average Score @ 1645.6 | 8 | |
| Automatic Subtitling | Overall Average across MSTCIN, ECSC, and EPI (test) | Subtitling Error Rate (AVG)59.2 | 6 | |
| Mathematical Reasoning | Overall Average | Avg Rank1.52 | 5 | |
| fMRI-to-image reconstruction | Overall Average NSD, HCP, BOLD5000, NOD | PixCorr0.104 | 4 |