Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
World modeling on WorldModelBench Robot in office
Loading...
2
Instruction Following Score
Multi-Agent Framework
1.584
1.692
1.8
1.908
May 14, 2026
Instruction Following Score
Physics Adherence Score
Common-Sense Reasoning Score
Updated 19d ago
Evaluation Results
Method
Method
Links
Instruction Following Score
Physics Adherence Score
Common-Sense Reasoning Score
Multi-Agent Framework
2026.05
2
3.8
1.1
Wan2.2-TI2V-5B
2026.05
1.6
3.5
1
Feedback
Search any
task
Search any
task