Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Offline Reinforcement Learning on D4RL Walker2d Stochastic MuJoCo (Mixed)
Loading...
450
Mean Return
CODAC-C
59.272
160.711
262.15
363.589
Jul 12, 2021
Mean Return
CVaR 0.1
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean Return
CVaR 0.1
CODAC-C
objective=CVaR
2021.07
450
261.4
CODAC-C
Objective=Risk-sensitive
2021.07
450
261.4
CODAC-N
objective=risk-neutral
2021.07
358.7
106.4
CODAC-N
Objective=Risk-neutral
2021.07
358.7
106.4
ORAAC
2021.07
222
-69.6
ORAAC
2021.07
222
-69.6
CQL
2021.07
74.3
-64
CQL
2021.07
74.3
-64
Feedback
Search any
task
Search any
task