Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Hallucination Reasoning on CounterCurate
Loading...
85.3
Accuracy
Argos
59.404
66.127
72.85
79.573
Dec 3, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Argos
2025.12
85.3
Video-R1
Training=RL
2025.12
63.6
Qwen2.5VL-7B
2025.12
61.4
Video-R1
Training=SFT
2025.12
60.6
Qwen2.5VL-7B
Chain-of-Thought (CoT)...
2025.12
60.4
Feedback
Search any
task
Search any
task