| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| LLaVA-Bench In-the-Wild | GPT-4V | Score93.1 | 23 | 1mo ago | |
| ViSiT-Bench Sept. 27th, 2023 (Leaderboard) | ELO1,382 | 15 | 1mo ago | ||
| MIA-Bench | Score8.86 | 12 | 1mo ago | ||
| CoIN | ϕ-DPO | SciQA Score77.84 | 9 | 18d ago | |
| LLaVA Wilder | VLSI-7B | Score92 | 9 | 1mo ago | |
| TouchStone | Qwen-VL-Chat | English Metric645.2 | 7 | 1mo ago | |
| LLaVA-Bench In-the-Wild 1.0 (test) | LLaVA | Conversation58.8 | 4 | 1mo ago |