Share your thoughts, 1 month free Claude Pro on us
See more
Feedback
Search any
task
Search any
task
SOTA World Modeling benchmarks and papers with code | Wizwand
Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Tasks
World Modeling
Benchmarks
Dataset Name
SOTA Method
Dataset Name
SOTA Method
Metric
Trend
Results
Last Updated
WorldPrediction-WM
VL-JEPA SFT
Accuracy
65.7
20
3mo ago
ScienceWorld
NeSyS
Matter Score
52.8
20
3mo ago
Plancraft
NeSyS
Smelt
98.4
20
3mo ago
Webshop (test)
Symbolic WM
Search
100
20
3mo ago
WorldArena (test)
Wan2.6
Image Quality
67.36
15
26d ago
nuScenes
GEM
CD (Inner)
0.3
12
23d ago
KITTI Odometry
GEM
CD Inner
0.13
10
23d ago
WorldArena
CtrlWorld
EWMScore
59.7
7
14d ago
WorldScore
FantasyWorld-1.0
Dynamic Score
71.39
7
1mo ago
1-min World Modeling Benchmark Hard-Trajectory
Infinite-World
R Score
41.31
6
19d ago
1-min world modeling benchmark Simple-Trajectory
SANA-WM + refiner
R (Trajectory Fidelity)
4.5
6
19d ago
60-second benchmark (Hard-Trajectory)
SANA-WM + refiner
PSNR
14.8
6
19d ago
60-second benchmark (Simple-Trajectory)
LingBot-World
PSNR
14.59
6
19d ago
Manhattan taxi rides
GPT
Next-Token Test Accuracy
100
5
9d ago
Crafter-OO
ONELIFE
Rank @ 1
18.7
5
1mo ago
Robosuite Push In-Distribution (test)
OrbiSim
PSNR (10 frames)
26.7105
4
15d ago
WorldScore
DreamWorld
3D Consistency
73.16
4
3mo ago
River Raid (test)
Finite Automata Extraction
FID
0.13
3
12d ago
River Raid (train)
Finite Automata Extraction
FID
0.14
3
12d ago
Pac-Man (test)
Game Engine Learning
FID
0.23
3
12d ago
Pac-Man (train)
Game Engine Learning
FID
0.19
3
12d ago
MIND First 50 (test)
Matrix-Game 2.0
Context Memory
11.88
3
3mo ago
MIND-Third 50 1.0 (test)
Matrix-Game 2.0
Long Context Memory
0.1404
3
3mo ago
Robosuite Push Out-of-Distribution (test)
OrbiSim
PSNR (10 steps)
27.1867
2
15d ago
WorldModelBench Robot in office scenario (test)
Multi-Agent Framework
Total Score
6.9
2
19d ago
Showing 25 of 31 rows
25 / page
50 / page
100 / page
1
2
Search any
task
Search any
task
Privacy Policy
Terms of Service
FAQs
Swarm Docs