Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Vision-Centric Perception on RealWorldQA
Loading...
69.3
Accuracy
QWEN2.5-VL-7B + PGT
46.42
52.36
58.3
64.24
May 22, 2026
Accuracy
Updated 9d ago
Evaluation Results
Method
Method
Links
Accuracy
QWEN2.5-VL-7B + PGT
Backbone=QWEN2.5-VL-7B...
2026.05
69.3
QWEN2.5-VL-7B + SPECIALIZED MIX
Backbone=QWEN2.5-VL-7B...
2026.05
68.9
INTERNVL3-8B + PGT
Backbone=INTERNVL3-8B,...
2026.05
68.5
IMAGE JIGSAW
Method=IMAGE JIGSAW
2026.05
68.5
QWEN2.5-VL-7B
Backbone=QWEN2.5-VL-7B
2026.05
67.5
THINKLITE-VL
Method=THINKLITE-VL
2026.05
67.5
INTERNVL3-8B
Backbone=INTERNVL3-8B
2026.05
65.2
QWEN2.5-VL-3B + PGT
Backbone=QWEN2.5-VL-3B...
2026.05
62.9
QWEN2.5-VL-3B + SPECIALIZED MIX
Backbone=QWEN2.5-VL-3B...
2026.05
62.7
LLAVA-NEXT-7B + PGT
Backbone=LLAVA-NEXT-7B...
2026.05
60.1
LLAVA-NEXT-LLAMA3-8B
Backbone=LLAVA-NEXT-LL...
2026.05
59.5
LLAVA-NEXT-LLAMA3-8B + PGT
Backbone=LLAVA-NEXT-LL...
2026.05
59.5
QWEN2.5-VL-3B
Backbone=QWEN2.5-VL-3B
2026.05
59
LLAVA-NEXT-7B
Backbone=LLAVA-NEXT-7B
2026.05
58.4
SPATIAL-LADDER-3B
Method=SPATIAL-LADDER-3B
2026.05
52.5
VIGORL-3B
Method=VIGORL-3B
2026.05
47.3
Feedback
Search any
task
Search any
task