| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Multimodal Evaluation | MM-Bench | Accuracy83 | 57 | |
| Multimodal Understanding | MM-Bench en (test) | Accuracy83.9 | 27 | |
| Multimodal Understanding | MM-Bench cn (test) | Accuracy79.2 | 19 | |
| Multimodal Benchmarking | MM-Bench 37 | Accuracy71.5 | 19 |