| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Real-world Understanding | WildVision | Win Rate80.6 | 17 | |
| Human Preferences | WildVision 0617 | Score89.4 | 14 | |
| Pointwise Scoring | WildVision (pointwise) | Kendall's Tau0.949 | 9 | |
| Multi-modal preference alignment | WildVision | Winning Rate40.2 | 6 | |
| Multi-modal Chat | WildVision 0617 (test) | General Score89.2 | 4 |