Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Multimodal Evaluation on Macro-average of HallusionBench, AMBER, CRPE, R-Bench, and BLINK
Loading...
63.35
Overall Score
IC-VCO
58.9716
60.1083
61.245
62.3817
May 29, 2026
Overall Score
Updated 2d ago
Evaluation Results
Method
Method
Links
Overall Score
IC-VCO
Contrastive Sample Sou...
2026.05
63.35
IC-VCO
Contrastive Sample Sou...
2026.05
62.83
SymMPO
Contrastive Sample Sou...
2026.05
62.11
mDPO
Contrastive Sample Sou...
2026.05
62.02
mDPO
Contrastive Sample Sou...
2026.05
61.64
SymMPO
Contrastive Sample Sou...
2026.05
61.5
S-VCO
Contrastive Sample Sou...
2026.05
61.41
S-VCO
Contrastive Sample Sou...
2026.05
60.81
DPO
Contrastive Sample Sou...
2026.05
60.4
V-DPO
Contrastive Sample Sou...
2026.05
60.38
DPO
Contrastive Sample Sou...
2026.05
60.32
V-DPO
Contrastive Sample Sou...
2026.05
60.15
LLaVA-NeXT-Interleave-Qwen-7B
Backbone=LLaVA-NeXT-In...
2026.05
59.14
Feedback
Search any
task
Search any
task