| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Generic Video Instance Retrieval | This-Is-My personalization P (test) | mAP56.4 | 5 | |
| Contextualized Video Instance Retrieval | This-Is-My test-time personalization dataset P | MRR4,200 | 5 | |
| Visual Question Answering | This-is-my Single Concept (test) | Accuracy92 | 4 | |
| Recognition | This-is-my Single Concept, 1 Reference View 32 | Precision83.4 | 4 | |
| Captioning | This-is-my Multi Concept (test) | Recall70.9 | 3 | |
| Visual Question Answering | This-is-my Multi Concept (test) | Accuracy72.2 | 3 | |
| Recognition | This-is-my Multi Concept 5 Reference Views 32 | Precision100 | 3 | |
| Recognition | This-is-my Single Concept, 5 Reference Views 32 | Precision90.1 | 3 | |
| Recognition | This-is-my Multi Concept 1 Reference View 32 | Precision100 | 3 | |
| Visual Question Answering | This-is-my Video (test) | Accuracy70 | 2 |