Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-modal Image Understanding on MMBench
Loading...
80.78
Accuracy
MHRoPE
50.9528
58.6964
66.44
74.1836
Oct 27, 2025
Nov 21, 2025
Dec 16, 2025
Jan 10, 2026
Feb 4, 2026
Mar 1, 2026
Mar 26, 2026
Accuracy
Updated 12d ago
Evaluation Results
Method
Method
Links
Accuracy
MHRoPE
Backbone=Qwen3-VL-8B-I...
2025.10
80.78
HoPE
Backbone=Qwen3-VL-8B-I...
2025.10
79.59
MRoPE-I
Backbone=Qwen3-VL-8B-I...
2025.10
79.5
Vanilla RoPE
Backbone=Qwen3-VL-8B-I...
2025.10
79.29
CircleRoPE
Backbone=Qwen3-VL-8B-I...
2025.10
79
VideoRoPE
Backbone=Qwen3-VL-8B-I...
2025.10
78.74
MRoPE
Backbone=Qwen3-VL-8B-I...
2025.10
78.27
InternVL3.5 + MoE-GRPO (ours)
Arch.=MoE, # activated...
2026.03
77.5
InternVL3.5 + Stoch-FT-Noise
Arch.=MoE, # activated...
2026.03
76.3
InternVL3.5 + Det-FT
Arch.=MoE, # activated...
2026.03
75.8
InternVL3.5 + Stoch-FT-Multi
Arch.=MoE, # activated...
2026.03
73.9
Mini-InternVL1.5
Arch.=Dense, # activat...
2026.03
70.9
InternVL2.5
Arch.=Dense, # activat...
2026.03
70.7
InternVL2
Arch.=Dense, # activat...
2026.03
65.4
LLaVA-OV
Arch.=Dense, # activat...
2026.03
52.1
Feedback
Search any
task
Search any
task