| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multi-task Language Modeling | MergeBench | Instruction Score39.56 | 11 | |
| Vision-Language Multi-task Performance | MergeBench (Vision-Language tasks: MMSI-Bench, EmbSpatial, MMMU_Med, PathVQA, OCRBench, CharXiv) | MMSI-Bench32.6 | 11 |