| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| MMQA | Accuracy70.5 | 36 | 3d ago | ||
| ScienceQA | CASHEW | Accuracy97.8 | 35 | 2d ago | |
| MM-Vet | Qwen3-VL-4B | Total Score68.3 | 24 | 3d ago | |
| ScienceQA v1.3 (test) | NAT Score0.9019 | 21 | 3d ago | ||
| SEED-Bench | QMoSLoRA | Accuracy (All)71.1 | 21 | 3d ago | |
| Recap-COCO | TCAP (Ours) | CP65.94 | 15 | 3d ago | |
| MULTIMODALQADoc | FIF | EM65.15 | 12 | 3d ago | |
| ScienceQA-IMG zero-shot | InstructBLIP | Accuracy70.4 | 12 | 3d ago | |
| ScienceQA (SQA) | HSR-VATA | Avg Accepted Length3.86 | 10 | 3d ago | |
| MMBench CN | MergeMix | Accuracy81.18 | 10 | 3d ago | |
| MultiModalQA (val) | HPROPRO | EM65.1 | 10 | 3d ago | |
| MMBench CN (test) | Accuracy88.9 | 9 | 3d ago | ||
| MMBench en (test) | Accuracy89 | 9 | 3d ago | ||
| CCBench | Qwen-VL-Chat | Score41.2 | 9 | 3d ago | |
| MMBench (test) | LLaVA-v1.5 | Score64.3 | 9 | 3d ago | |
| MMCOQADoc | FIF | EM51.11 | 6 | 3d ago | |
| MMBench EN | QMoSLoRA | Accuracy73.8 | 6 | 3d ago | |
| ScienceQA (SQA) (test) | SQA Accuracy70.2 | 4 | 3d ago | ||
| ScienceQA (test) | ViT-L/14-224 | Accuracy91.2 | 3 | 3d ago |