Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Constrained Offline Reinforcement Learning on DSRL HalfCheetahVelocity

105Normalized Return

BCQ-Lag

25.9646.486787.52Dec 9, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.12
10518.21
2025.12
1000.01
2025.12
980.77
2025.12
9713.1
2025.12
880.54
2025.12
650
2025.12
290.74