Share your thoughts, 1 month free Claude Pro on usSee more

High-Resolution Visual Reasoning on HR-Bench 4K

83.4Accuracy

GazeVLM (Ours)

Updated 2mo ago

Evaluation Results

Method	Links
GazeVLM (Ours) 2026.05		83.4
Qwen3-VL-4B 2026.05		79.5
GazeVLM (w/o gaze bias) 2026.05		79
Qwen3.5-4B-Inst 2026.05		78.6
DeepEyes 2026.05		75.1
Ground-R1-7B 2026.05		75
MGPO 2025.07		74.2
Pixel-Reasoner-7B 2026.05		74
MGPO 2025.07		70.9
SPARC-4B 2026.05		70.5
GRPO 2025.07		69.8
Qwen2.5-VL-7B 2026.05		68.5
GRPO 2025.07		67.9
Gemini2.5-Flash-Lite 2026.05		67.2
GPT-4o 2026.05		59