Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Visual Understanding on AI2D
Loading...
83.89
Accuracy
IREASONER
82.1116
82.5733
83.035
83.4967
Jan 9, 2026
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
IREASONER
Reward Type=Continuous...
2026.01
83.89
EvoLMM
2026.01
83.41
Qwen2.5-VL-7B w/ Discrete Reward + Step-level Majority
Reward Type=Discrete,...
2026.01
82.95
Vision-Zero
external supervision=true
2026.01
82.64
Qwen2.5-VL-7B (Baseline)
Backbone=Qwen2.5-VL-7B
2026.01
82.61
Qwen2.5-VL-7B w/ Discrete Reward
Reward Type=Discrete
2026.01
82.18
Feedback
Search any
task
Search any
task