Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Scene Reasoning on 3D Navigation Evaluation Suite
Loading...
100
Visual Consistency
Wan 2.2
44.88
59.19
73.5
87.81
Feb 10, 2026
Visual Consistency
Dynamic Feasibility
Task Completion
Updated 1mo ago
Evaluation Results
Method
Method
Links
Visual Consistency
Dynamic Feasibility
Task Completion
Wan 2.2
Category=Open-source,...
2026.02
100
100
93
Wan 2.6
Category=Closed, Sampl...
2026.02
100
100
87
LVP
Category=Open-source,...
2026.02
93
100
33
Cosmos 2.5
Category=Open-source,...
2026.02
60
92
20
Hunyuan 1.5
Category=Open-source,...
2026.02
47
100
37
Feedback
Search any
task
Search any
task