Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Reinforcement Learning on D4RL (MuJoCo M/MR/ME & Adroit/Kitchen Subset)

113.3Walker2d (Medium Expert) Score

ICQL

68.99680.49892103.502Sep 28, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.09
113.380.381.911162.696.489.145.944.785.689.43.74.517.111.779.359.561.569.7
2025.09
109.871.56198.563.382.483.442.538.989.584.97.20.59.87.659.253.345.863.2
2025.09
109.58581.9109.892.3100.193.647.944.666.267.518.413.40.13.722.53037.562
2025.09
109.27741.578.253.559.462.843.141.864.676.81.51.80.2-0.157.553.546.747.7
2025.09
109.282.839.498.760.687.484.644.636.991.568.91.10.2-0.196047.562.560
2025.09
107.868.967.2109.462.481.690.94238.917.832.40.71.313.22.432.554.153.857.2
2025.09
107.575.32652.552.918.155.242.636.663.9371.20.6206551.53847.3
2025.09
102.834.146.64464.618.694.248.240.555.754.81.22.20.74.427.56052.547.7
2025.09
10172.460.960.155.65592.943.94061.223.51.11.70.20.116.34515.850.5
2025.09
98.779.277.2105.4589562.444.445.537.539.24.42.19.90.143.85149.858.5
2025.09
70.770.254.857.557.165.870.842.839.579.5741.73.75.53.252.5605556.2