Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PhysFlow: Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation

About

Realistic simulation of dynamic scenes requires accurately capturing diverse material properties and modeling complex object interactions grounded in physical principles. However, existing methods are constrained to basic material types with limited predictable parameters, making them insufficient to represent the complexity of real-world materials. We introduce PhysFlow, a novel approach that leverages multi-modal foundation models and video diffusion to achieve enhanced 4D dynamic scene simulation. Our method utilizes multi-modal models to identify material types and initialize material parameters through image queries, while simultaneously inferring 3D Gaussian splats for detailed scene representation. We further refine these material parameters using video diffusion with a differentiable Material Point Method (MPM) and optical flow guidance rather than render loss or Score Distillation Sampling (SDS) loss. This integrated framework enables accurate prediction and realistic simulation of dynamic interactions in real-world scenarios, advancing both accuracy and flexibility in physics-based simulations.

Zhuoman Liu, Weicai Ye, Yan Luximon, Pengfei Wan, Di Zhang• 2024

Related benchmarks

TaskDatasetResultRank
System IdentificationSynthetic dataset
RE1
50
System IdentificationSynthetic dataset
Rel Error (delta_mu)0.004
12
System IdentificationCholecSeg8K cutting
Registration Error (RE)0.196
5
Physically-grounded Video GenerationPhysically-grounded Video Evaluation Set Human Designed, Real World, and AI Generated scenes
OC17.96
5
System IdentificationPorcineEndo gallbladder
RE0.296
5
System IdentificationPorcineEndo stomach
Residual Error (RE)0.668
5
Physical Realism AssessmentSurgical Video Dataset (EndoNeRF, CholecSeg8K, and PorcineEndo) (test)
Physical Realism Score3.26
5
System IdentificationEndoNeRF v01_080
RE3.836
5
System IdentificationEndoNeRF v01_240
RE3.29
5
System IdentificationCholecSeg8K pulling
RE (Error Rate)4.651
5
Showing 10 of 20 rows

Other info

Code

Follow for update