Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Coding Agent Is Good As World Simulator

About

World models have emerged as a powerful paradigm for building interactive simulation environments, with recent video-based approaches demonstrating impressive progress in generating visually plausible dynamics. However, because these models typically infer dynamics from video and represent them in latent states, they do not explicitly enforce physical constraints. As a result, the generated video rollouts are not physically plausible, exhibiting unstable contacts, distorted shapes, or inconsistent motion. In this paper, we present an agentic framework constructing physics-based world models through executable simulation code. The framework coordinates planning, code generation, visual review, and physics analysis agents. The planning agent converts the natural language prompt into a structured scene plan, the code agent implements it as executable simulation code, and the visual review agent provide visual feedback while the physics analysis agent checks physical consistency. The code is iteratively revised based on the feedback until the simulation matches the prompt reqirements and physical constraints. Experimental results show that our framework outperforms advanced video-based models in physical accuracy, instruction fidelity and visual quality, which could be applied to various scenarios including driving simulation and embodied robot tasks.

Hongyu Wang, Jingquan Wang, Bocheng Zou, Radu Serban, Dan Negrut• 2026

Related benchmarks

TaskDatasetResultRank
World ModelingWorldModelBench Aggregated across three scenarios
Instruction Score5.9
2
World ModelingWorldModelBench Vehicle FSI
Instruction Following Score2.9
2
World ModelingWorldModelBench Outdoor vehicle
INSTR Score1
2
World ModelingWorldModelBench Robot in office
Instruction Following Score2
2
World ModelingWorldModelBench Vehicle FSI scenario (test)
Total Score6.8
2
World ModelingWorldModelBench Outdoor vehicle scenario (test)
Total Score5.8
2
World ModelingWorldModelBench Robot in office scenario (test)
Total Score6.9
2
Showing 7 of 7 rows

Other info

Follow for update