Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Offline Reinforcement Learning on D4RL Cheetah Stochastic MuJoCo (Mixed)
Loading...
396.4
Mean Return
CODAC-C
206.808
256.029
305.25
354.471
Jul 12, 2021
Mean Return
CVaR 0.1
Updated 4d ago
Evaluation Results
Method
Method
Links
Mean Return
CVaR 0.1
CODAC-C
objective=CVaR
2021.07
396.4
238.5
CODAC-C
Objective=Risk-sensitive
2021.07
396.4
238.5
CODAC-N
objective=risk-neutral
2021.07
347.7
149.2
CODAC-N
Objective=Risk-neutral
2021.07
347.7
149.2
ORAAC
2021.07
307.1
118.9
ORAAC
2021.07
307.1
118.9
CQL
2021.07
214.1
12
CQL
2021.07
214.1
12
Feedback
Search any
task
Search any
task