Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Error Detection on CoSPlan Robo-VQA-E
Loading...
45.3
Accuracy
GPT-4o
7.652
17.426
27.2
36.974
Dec 11, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
GPT-4o
Reasoning Strategy=Cha...
2025.12
45.3
GPT-4o
Reasoning Strategy=Sce...
2025.12
44.2
CoG-VLM
Reasoning Strategy=Sce...
2025.12
35.3
CoG-VLM
Reasoning Strategy=Cha...
2025.12
33.4
CoG-VLM
Reasoning Strategy=Van...
2025.12
32.1
Janus-pro-7B
Reasoning Strategy=Sce...
2025.12
26.1
Intern-VLM
Reasoning Strategy=Sce...
2025.12
26.1
Random
2025.12
25.4
Intern-VLM
Reasoning Strategy=Cha...
2025.12
25.2
Intern-VLM
Reasoning Strategy=Van...
2025.12
24.3
Janus-pro-7B
Reasoning Strategy=Cha...
2025.12
18.1
Janus-pro-7B
Reasoning Strategy=Van...
2025.12
17.5
Qwen2 VL-8B
Reasoning Strategy=Sce...
2025.12
9.6
Qwen2 VL-8B
Reasoning Strategy=Van...
2025.12
9.2
Qwen2 VL-8B
Reasoning Strategy=Cha...
2025.12
9.1
Feedback
Search any
task
Search any
task