Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Preference-Based Reinforcement Learning on Meta-World

63.2Lever Pull Success Rate

OPRL

31.89640.02348.1556.277Feb 19, 2026
Updated 13d ago

Evaluation Results

MethodLinks
2026.02
63.23.577.410.60.87.463.534.337.16.830.4
2026.02
51.87979.959.117.7102.28845.471.678.565.3
2026.02
51.712.437.31.81.13.484.341.59.2825
2026.02
49.216.84.916.71.174.88251.39.8831.4
2026.02
33.167.479.630.71489.275.844.363.17357