Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Downstream model-based control on Walker2d OpenAI Gym (test)
Loading...
2,134.6
Accumulated Reward
Ours
38.5632
582.7266
1,126.89
1,671.0534
Mar 15, 2026
Accumulated Reward
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accumulated Reward
Ours
Pretrain=true
2026.03
2,134.6
Trajworld
Pretrain=true
2026.03
1,933.52
Ours
Pretrain=false
2026.03
707.61
Trajworld
Pretrain=false
2026.03
395.23
TDM
Pretrain=true
2026.03
207.61
MLPEnsemble
Pretrain=true
2026.03
190.51
TDM
Pretrain=false
2026.03
122.65
MLPEnsemble
Pretrain=false
2026.03
119.18
Feedback
Search any
task
Search any
task