Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Safe Reinforcement Learning on Bullet Safety Gym

0.73Normalized Reward

BCQ-Lag

0.3140.4220.530.638Feb 9, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
0.733.11
2026.02
0.643.36
2026.02
0.630.68
2026.02
0.610.77
2026.02
0.610.56
2026.02
0.542.55
2026.02
0.520.82
2026.02
0.483.81
2026.02
0.390.03
2026.02
0.331.15