Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Real-world perception-centric reasoning on V* (test)
Loading...
84.25
Accuracy
GLM-9B-DeltaThinker
57.0748
64.1299
71.185
78.2401
May 15, 2026
Accuracy
Updated 16d ago
Evaluation Results
Method
Method
Links
Accuracy
GLM-9B-DeltaThinker
Algorithm=DeltaPrompts
2026.05
84.25
Qwen-8B-DeltaThinker
Algorithm=DeltaPrompts
2026.05
82.73
Vision-R1-7B
2026.05
81.15
GLM-4.1V-9B-Thinking
2026.05
79.58
Qwen3-VL-8B-Thinking
2026.05
76.96
ARES-RL-7B
2026.05
71.2
REVisual-R1
2026.05
69.11
Bee-8B-RL
2026.05
58.12
Feedback
Search any
task
Search any
task