| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Visual Instruction Following | Visual Instruction Total (test) | Avg. Response Length (Words)153.04 | 6 | |
| Visual Instruction Following | Visual Instruction Out-Of-Distribution - Hard (test) | Win Ratio (Human)88 | 5 | |
| Visual Instruction Following | Visual Instruction Out-Of-Distribution - Simple (test) | Human Win Ratio93 | 5 | |
| Visual Instruction Following | Visual Instruction Out-Of-Distribution (test) | Human Win Ratio90 | 5 | |
| Visual Instruction Following | Visual Instruction In-Distribution - Hard (test) | Human Win Ratio89 | 5 | |
| Visual Instruction Following | Visual Instruction In-Distribution - Simple (test) | Human Win Ratio89 | 5 | |
| Visual Instruction Following | Visual Instruction In-Distribution (test) | Human Win Ratio89 | 5 |