Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Constrained Offline Reinforcement Learning on DSRL CarCircle1
Loading...
0.73
Normalized Return
BCQ-Lag
-0.0084
0.1833
0.375
0.5667
Dec 9, 2025
Normalized Return
Cost
Updated 1mo ago
Evaluation Results
Method
Method
Links
Normalized Return
Cost
BCQ-Lag
2025.12
0.73
5.25
BC-All
2025.12
0.72
4.39
COptiDICE
2025.12
0.7
5.72
CDT
2025.12
0.6
1.73
MPDiffuser
num_samples=16
2025.12
0.5
0.85
BC-Safe
2025.12
0.37
1.38
CPQ
2025.12
0.02
2.29
Feedback
Search any
task
Search any
task