| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Personalized Grounding | MyVLM | Precision100 | 14 | |
| Recognition | MyVLM | Single Recall98.4 | 11 | |
| Image Captioning | MyVLM | Caption Recall (Single)0.975 | 11 | |
| Captioning | MyVLM Single Concept (test) | Recall91.3 | 4 | |
| Recognition | MyVLM Single Concept, 1 Reference View | Precision86 | 4 | |
| instance-aware caption generation | MyVLM benchmark | CLIP Image Similarity27.06 | 4 | |
| Recognition | MyVLM Single Concept, 5 Reference Views 1 | Precision87.7 | 3 | |
| Instance Identification | MyVLM benchmark | Positive Score97 | 3 |