| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Image Reasoning | ZeroBench sub | Accuracy24.4 | 14 | |
| Multimodal reasoning | ZeroBench | Accuracy26.35 | 14 | |
| Multimodal reasoning | ZeroBench main | Pass@111 | 13 | |
| Math | ZeroBench | Score17.66 | 8 | |
| General Reasoning & Understanding | ZeroBench | Accuracy18.9 | 8 | |
| Multimodal reasoning | ZeroBench sub | Pass@130.8 | 7 |