Share your thoughts, 1 month free Claude Pro on usSee more

Constrained Offline Reinforcement Learning on DSRL CarCircle1

0.73Normalized Return

BCQ-Lag

Updated 5mo ago

Evaluation Results

Method	Links
BCQ-Lag 2025.12		0.73	5.25
BC-All 2025.12		0.72	4.39
COptiDICE 2025.12		0.7	5.72
CDT 2025.12		0.6	1.73
MPDiffuser 2025.12		0.5	0.85
BC-Safe 2025.12		0.37	1.38
CPQ 2025.12		0.02	2.29