| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Personalized Grounding | Yo'LLaVA | Precision100 | 14 | |
| Recognition | Yo'LLaVA | Rec. Single96.2 | 11 | |
| Multiple Choice Question Answering | Yo'LLaVA | Choice-V & T Accuracy (Single)94.2 | 11 | |
| Visual Question Answering | Yo’LLaVA Single Concept (test) | Accuracy97.6 | 4 | |
| Recognition | Yo'LLaVA Single Concept, 1 Reference View 24 | Precision77.2 | 4 | |
| Recognition | Yo'LLaVA Single Concept, 5 Reference Views 24 | Precision85 | 3 | |
| Instance Identification | Yo'LLaVA benchmark | Positive Score94.9 | 3 |