| Dataset Name | SOTA Method | Metric | Trend | ||
|---|---|---|---|---|---|
| WOMD (val) | EvoQRE | NLL (bits)2.83 | 8 | 4d ago | |
| WOMD Sim Agents 2024 | SMART | Realism Score75.11 | 7 | 4d ago | |
| Waymo Open Sim Agents Challenge 2025 | UniMotion | Realism Score78.51 | 6 | 4d ago | |
| PhyGenBench PhysGen scenes | ECMS0.54 | 3 | 4d ago |