| Task Name | Dataset Name | SOTA Result | Trend | |
|---|---|---|---|---|
| Image Captioning | DID-Bench GT-{GPT4-V} | BLEU-140.3 | 19 | |
| Image Captioning | DID-Bench GT-{LLaVA} | BLEU-142.84 | 19 | |
| Image Captioning | DID-Bench GT-LLaVA (test) | BLEU-139.93 | 15 | |
| Image Captioning | DID-Bench GT-GPT4-V 1.0 (test) | BLEU-136.83 | 15 | |
| Multimodal Evaluation | DID-Bench | CLIP-S Score41.19 | 12 | |
| Image Captioning | DID-Bench | CIDEr3.31 | 4 | |
| Image Captioning | DID-Bench (val) | CIDEr- | 0 | |
| Image Captioning | DID-Bench GT-LLaVA 1.0 (test) | BLEU-1- | 0 |