Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multimodal Reasoning on SPD-Faith Bench Easy 1.0
Loading...
5
Contradiction Rate
GPT-4o
3.62
12.935
22.25
31.565
Feb 8, 2026
Contradiction Rate
Updated 3d ago
Evaluation Results
Method
Method
Links
Contradiction Rate
GPT-4o
Model Type=Proprietary
2026.02
5
GLM-4.5V
Model Type=Open-Source
2026.02
8
Qwen3-VL-235B-A22B
Model Type=Open-Source...
2026.02
10.5
Qwen3-VL-32B
Model Type=Open-Source...
2026.02
16
Gemini-2.5-Pro
Model Type=Proprietary
2026.02
16.5
Claude-4.5-Haiku
Model Type=Proprietary
2026.02
16.5
Qwen2.5-VL-72B
Model Type=Open-Source...
2026.02
19
InternVL2.5-38B
Model Type=Open-Source...
2026.02
19.2
Qwen2.5-VL-7B
Model Type=Open-Source...
2026.02
23
MiniCPM-V-2.6
Model Type=Open-Source
2026.02
23
InternVL2.5-8B
Model Type=Open-Source...
2026.02
32.5
DeepSeek-VL2
Model Type=Open-Source
2026.02
39.5
Feedback
Search any
task
Search any
task