Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multimodal Understanding on MMT emo
Loading...
60.8
Accuracy
CoMemo
50.4
53.1
55.8
58.5
May 1, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
CoMemo
Training Strategy=Visu...
2026.05
60.8
PVM-8B (SFT)
Backbone=8B, Training...
2026.05
58.3
PVM-8B (SFT + GRPO)
Backbone=8B, Training...
2026.05
58.3
PVM-4B (SFT)
Backbone=4B, Training...
2026.05
57.5
Qwen3-VL-8B-Instruct
Backbone=8B
2026.05
56.7
Qwen3-VL-4B-Instruct
Backbone=4B
2026.05
56.7
Qwen3-VL-4B (LoRA-SFT + GRPO)
Backbone=4B, Training...
2026.05
55.8
PVM-4B (SFT + GRPO)
Backbone=4B, Training...
2026.05
55.8
PEARL-8B
Backbone=8B, Training...
2026.05
55
Qwen3-VL-4B (LoRA-SFT)
Backbone=4B, Training...
2026.05
55
MemVR
Training Strategy=Visu...
2026.05
54.2
ICoT
Training Strategy=Visu...
2026.05
54.2
Euclid-8B
Backbone=8B, Training...
2026.05
54.2
Qwen3-VL-8B (SFT + GRPO)
Backbone=8B, Training...
2026.05
54.2
Qwen3-VL-4B (SFT + GRPO)
Backbone=4B, Training...
2026.05
53.3
Qwen3-VL-8B (LoRA-SFT + GRPO)
Backbone=8B, Training...
2026.05
52.5
OneThinker-8B
Backbone=8B, Training...
2026.05
51.7
Qwen3-VL-8B (LoRA-SFT)
Backbone=8B, Training...
2026.05
51.7
Qwen3-VL-8B (SFT)
Backbone=8B, Training...
2026.05
50.8
Qwen3-VL-4B (SFT)
Backbone=4B, Training...
2026.05
50.8
Feedback
Search any
task
Search any
task