Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
World Modeling on WorldModelBench (Aggregated Scenarios)
Loading...
5.9
Instruction Score
Multi-Agent Framework
3.404
4.052
4.7
5.348
May 14, 2026
Instruction Score
Physical Laws Score
Common Sense Score
Difference
p-value
Updated 19d ago
Evaluation Results
Method
Method
Links
Instruction Score
Physical Laws Score
Common Sense Score
Difference
p-value
Multi-Agent Framework
2026.05
5.9
11.3
2.3
-
-
Wan2.2-TI2V-5B
2026.05
3.5
10.9
2.3
-
-
Feedback
Search any
task
Search any
task