Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Move to Aim on Physical Simulator Out-of-Domain evaluation
Loading...
82.1
Success Rate
RL w. World Model
7.74
27.045
46.35
65.655
Dec 3, 2025
Success Rate
Updated 4d ago
Evaluation Results
Method
Method
Links
Success Rate
RL w. World Model
Policy=OpenVLA
2025.12
82.1
RL w. World Model
Policy=MLP
2025.12
72
RL w. ManiSkill
Policy=MLP
2025.12
42.9
RL w. ManiSkill
Policy=OpenVLA
2025.12
41.5
Supervised Fintune
Policy=OpenVLA
2025.12
24.5
Supervised Fintune
Policy=MLP
2025.12
10.6
Feedback
Search any
task
Search any
task