Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-image Visual Question Answering on Mantis
Loading...
76.5
Accuracy
DelimScaling
74.0976
74.7213
75.345
75.9687
Feb 2, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
DelimScaling
Backbone=InternVL3, Mo...
2026.02
76.5
DelimScaling
Backbone=Qwen2.5-VL, M...
2026.02
75.58
Baseline
Backbone=InternVL3, Mo...
2026.02
74.65
Baseline
Backbone=Qwen2.5-VL, M...
2026.02
74.19
Feedback
Search any
task
Search any
task