Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Real-world perception-centric reasoning on RealWorldQA (test)
Loading...
77.04
Accuracy
GLM-9B-DeltaThinker
63.8528
67.2764
70.7
74.1236
May 15, 2026
Accuracy
Updated 16d ago
Evaluation Results
Method
Method
Links
Accuracy
GLM-9B-DeltaThinker
Algorithm=DeltaPrompts
2026.05
77.04
Qwen-8B-DeltaThinker
Algorithm=DeltaPrompts
2026.05
75.82
Qwen3-VL-8B-Thinking
2026.05
73.07
GLM-4.1V-9B-Thinking
2026.05
72.19
Bee-8B-RL
2026.05
72.03
Vision-R1-7B
2026.05
67.58
ARES-RL-7B
2026.05
66.67
REVisual-R1
2026.05
64.36
Feedback
Search any
task
Search any
task