Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Real-World Understanding on Tree Bench
Loading...
42.5
Accuracy
Qwen2.5-VL
36.78
38.265
39.75
41.235
Nov 7, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2.5-VL
Tool=✗, Param Size=32B
2025.11
42.5
DeepEyesV2
Tool=General, Param Si...
2025.11
42.5
Pixel-Reasoner
Tool=Crop, Param Size=7B
2025.11
39
InternVL3
Tool=✗, Param Size=8B
2025.11
38.8
DeepEyes
Tool=Crop, Param Size=7B
2025.11
37.5
LLaVA-OV
Tool=✗, Param Size=7B
2025.11
37.3
Qwen2.5-VL
Tool=✗, Param Size=7B
2025.11
37
Feedback
Search any
task
Search any
task