Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Error Detection on CoSPlan Blocks-World-E
Loading...
44.5
Accuracy
CoG-VLM
25.364
30.332
35.3
40.268
Dec 11, 2025
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
CoG-VLM
Reasoning Strategy=Sce...
2025.12
44.5
CoG-VLM
Reasoning Strategy=Cha...
2025.12
43.1
GPT-4o
Reasoning Strategy=Sce...
2025.12
42.1
CoG-VLM
Reasoning Strategy=Van...
2025.12
41.3
Intern-VLM
Reasoning Strategy=Cha...
2025.12
37.9
Intern-VLM
Reasoning Strategy=Sce...
2025.12
37.3
Intern-VLM
Reasoning Strategy=Van...
2025.12
36.5
Qwen2 VL-8B
Reasoning Strategy=Sce...
2025.12
35.2
GPT-4o
Reasoning Strategy=Cha...
2025.12
35.1
Qwen2 VL-8B
Reasoning Strategy=Van...
2025.12
32.3
Janus-pro-7B
Reasoning Strategy=Cha...
2025.12
31
Qwen2 VL-8B
Reasoning Strategy=Cha...
2025.12
30.6
Janus-pro-7B
Reasoning Strategy=Van...
2025.12
29.3
Janus-pro-7B
Reasoning Strategy=Sce...
2025.12
27.6
Random
2025.12
26.1
Feedback
Search any
task
Search any
task