Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Knowledge QA on CDDMBench
Loading...
88.5
QA Accuracy
Qwen-VL-Chat-AG* (7B)
25.06
41.53
58
74.47
Jan 8, 2026
QA Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
QA Accuracy
Qwen-VL-Chat-AG* (7B)
Method=SFT (Frozen enc...
2026.01
88.5
Gpt-5-Nano
Method=+Judge
2026.01
84.5
Qwen-VL-Chat-AG (7B)
Method=SFT (Unfrozen e...
2026.01
84
Gpt-5-Nano
Method=Expl. Caption
2026.01
84
Qwen2.5-VL-3B-Instruct
Method=Reasoning-Enhan...
2026.01
84
Gpt-5-Nano
Method=+Few-shot
2026.01
76
Qwen2.5-VL-3B-Instruct
Method=GRPO
2026.01
72.49
Gpt-5-Nano
Method=Zero-shot
2026.01
65
Qwen2.5-VL-3B-Instruct
Method=SFT
2026.01
63
Qwen-VL-Chat (7B)
Method=+Judge
2026.01
51
Qwen-VL-Chat (7B)
Method=+Few-shot
2026.01
50
Qwen-VL-Chat (7B)
Method=Expl. Caption
2026.01
46.5
Qwen2.5-VL-3B-Instruct
Method=Few-shot
2026.01
45.5
Qwen-VL-Chat (7B)
Method=Zero-shot
2026.01
41
Qwen2.5-VL-3B-Instruct
Method=Zero-shot
2026.01
27.5
Feedback
Search any
task
Search any
task