Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic Reasoning on VisualProbe Medium
Loading...
84.7
Accuracy
DeepEyes-7B
71.492
74.921
78.35
81.779
Nov 25, 2025
Accuracy
Updated 26d ago
Evaluation Results
Method
Method
Links
Accuracy
DeepEyes-7B
Activation Replay=true
2025.11
84.7
DeepEyes-7B
Activation Replay=false
2025.11
82.1
Qwen2.5-VL-7B
Activation Replay=false
2025.11
72
Feedback
Search any
task
Search any
task