Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reasoning on MME-r reasoning
Loading...
722.5
Score
Qwen3-VL-8B-Thinking
654.068
671.834
689.6
707.366
Feb 18, 2026
Score
Updated 3d ago
Evaluation Results
Method
Method
Links
Score
Qwen3-VL-8B-Thinking
Backbone=Qwen3-VL-8B,...
2026.02
722.5
SAP
Backbone=Qwen3-VL-8B,...
2026.02
689.9
Qwen3-VL-8B-Instruct
Backbone=Qwen3-VL-8B,...
2026.02
656.7
Feedback
Search any
task
Search any
task