Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Offline Reinforcement Learning on D4RL MuJoCo (Random, Med-Replay, Medium, Expert Configurations)

1,020.7Total Score

CABI+TD3_BC

427.9581.8735.7889.6Feb 13, 2022Mar 5, 2022Mar 26, 2022Apr 15, 2022May 6, 2022May 26, 2022Jun 16, 2022
Updated 4d ago

Evaluation Results

MethodLinks
1,020.715.111.96.444.431.329.445.1100.482105112.7108.4107.6112.4108.6
2022.06
979.310.2111.443.331.425.242.899.579.797.9112.2101.1105.7112.2105.7
2022.06
974.632.211.40.643.335.642.641.399.479.596.190.6103.6106.8112.379.9
2022.06
802.625.111.47.338.633.719.241.752.159.153.496.340.1108.2110.3106.1
2022.02
773---------------
2022.06
764.321.710.72.741.928.615.837.244.257.527.1111.4-82.4111.2103.8
2022.02
698.5---------------
2022.02
692.4---------------
2022.02
684.6---------------
2022.02
677.4---------------
2022.02
672.6---------------
2022.06
618.62.39.83.838.9188.437.430.317.440.695.414.8104109.188.4
2022.06
595.329.51.234.719.78.336.63011.467.689.612105.2111.556
2022.02
466.7---------------
2022.02
450.7---------------
2022.06
-35.411.713.653.167.53942.32817.863.323.744.6---
2022.06
-2.210.64.938.233.11540.754.553.164.7110.957.5---